Staff Software Engineer (DevOps)
Kaseya
We do not know your resume yet
Upload your resume to unlock your actual match score and identify important JD keywords before applying.
Recruiters may search these ATS Keywords in your resume
Keywords
Job Description
Kaseya is the leading provider of AI-powered IT management and cybersecurity software, serving Managed Service Providers (MSPs) and internal IT organizations worldwide. Our comprehensive platform helps organizations efficiently manage, secure, and automate their IT environments, driving operational efficiency and long-term business success.
Backed by Insight Partners, a leading global software investor, Kaseya has experienced sustained double-digit growth and continues to expand its global footprint. Today, Kaseya supports customers in more than 20 countries and manages over 15 million endpoints worldwide.
Founded in 2000, Kaseya has built a culture centered around innovation, accountability, and results. We are a high-growth, high-performance organization that values individuals who are driven, adaptable, and committed to delivering exceptional outcomes for our customers and teammates alike.
At Kaseya, success comes from embracing challenges, moving with urgency, and continuously raising the bar.
About the Role:
- Kaseya's SaaS Backup products protect M365, Google Workspace, and Salesforce data for hundreds of thousands of businesses globally.
- We are building the next generation of our backup storage platform — a unified, cloud agnostic system built to operate at massive scale across trillions of objects and petabytes of data.
- As a Staff Engineer, you will own one or more platform services end to end: design, implementation, testing, and production readiness. You work with high autonomy on well-scoped problems and are the go-to technical owner for your domain.
- Own the full lifecycle of a platform service — from design through production deployment and on call support
- Build reliable, well tested backend services in Go with a strong focus on correctness and operational simplicity
- Design and implement integrations with SaaS APIs (M365, Google Workspace, Salesforce) handling rate limits, delta sync checkpointing, and failure recovery
- Collaborate closely with principal and senior engineers on cross service interfaces and data models
- Write high quality code that sets the standard for the engineers around you
- Participate in design reviews and contribute to architectural decisions within your domain
- 8–12 years of DevOps / SRE experience, with at least 3 years managing large-scale infrastructure
- Kubernetes — cluster operations, resource management, custom controllers, multi-tenant workload isolation
- Infrastructure as Code — Terraform or Pulumi at production scale; versioned, modular, reusable
- CI/CD pipeline ownership — designing and maintaining pipelines (GitHub Actions, Jenkins, ArgoCD or equivalent)
- Observability stack — metrics, logs, and traces in production (Prometheus, Grafana, Datadog or equivalent); defining SLOs/SLAs, not just dashboards
- Incident management at scale — structured on-call, alert triage, runbooks, post-mortems; experience reducing alert noise (1K+ alerts/month environment)
- Networking fundamentals — DNS, load balancing, firewalls, VPC/overlay networks in hybrid environments
- Security & compliance mindset — secrets management (Vault), RBAC, image scanning, audit logging; critical for a backup product handling customer data
- Scripting proficiency — Go or Python for automation; shell scripting for ops tooling
- Linux systems depth — performance tuning, kernel parameters, storage I/O, process management at scale
- OpenStack operations — managing Nova, Swift, Neutron, Cinder at scale
- Multi-cloud abstraction — managing workloads across AWS, GCP, Azure and private cloud with consistent tooling
- Large-scale infrastructure (5K+ nodes) — capacity planning, hardware lifecycle, rack-level failure domains
- Cost optimization / FinOps — cloud spend analysis, rightsizing, storage tiering strategies
- Chaos engineering — fault injection, game days, resilience testing (Chaos Monkey, Litmus)
- Bare metal provisioning — PXE boot, IPMI, automated OS provisioning at scale (Ironic, MaaS)
- Backup/DR domain awareness — understanding RPO/RTO, storage replication, data protection pipelines
- Go proficiency — reading and debugging Go services, contributing to internal tooling
- High ownership over a real production service from day one
- Greenfield platform with strong technical leadership and a clear roadmap
- Small team where your contributions are visible and impactful
- Competitive salary and benefits
Kaseya provides equal employment opportunity to all employees and applicants without regard to race, religion, age, ancestry, gender, sex, sexual orientation, national origin, citizenship status, physical or mental disability, veteran status, marital status, or any other characteristic protected by applicable law.
About The Company
Kaseya
Kaseya is the leading global provider of AI-powered cybersecurity and IT management software. Through its customer-centric approach and renowned support, Kaseya delivers best-in-breed technologies that empower organizations to seamlessly manage IT infrastructure, secure networks, backup critical data, manage service operations and grow their businesses. Kaseya offers a broad array of IT management solutions from industry-leading providers: audIT, ConnectBooster, Datto, Graphus, ID Agent, IT Glue, Kaseya, RapidFire Tools, RocketCyber, Secure Payments, Spanning Cloud Apps, TruMethods, Unitrends and Vonahi. These innovative solutions fuel Kaseya’s IT Complete platform, which addresses the challenges of multifunctional IT professionals. IT Complete empowers them to centrally command hardware, software, security, data, compliance, operations and more from within a comprehensive, integrated, intelligent (AI utilization-optimized), and affordable platform. Headquartered in Miami, Florida, Kaseya is privately held with a global presence in more than a dozen countries.
How to Apply Better for This Job
This section explains the correct next step without forcing sign-in immediately.
Check ATS score before applying
Scan your resume for ATS readability, formatting issues, missing sections, weak keywords, and content gaps.
Customize your resume for this JD
Match your resume with the job description and add ai , go , aws , dns , gcp , keywords where they fit naturally.
Find similar jobs too
Do not depend on one opening. Use your resume to find similar frontend jobs across relevant job platforms.
Ready with your customized resume?
Once your resume includes the right skills and is ATS-friendly, you can apply directly on the source platform.
Market Insights:Best Programming Analyst Jobs in India
Find the latest Programming Analyst jobs across top Indian cities. Compare job counts by location and apply where hiring demand is higher.