HomeJobsOkta › Tech Ops-610

Staff Site Reliability Engineer - Observability

Okta Bellevue, Washington; New York, New York; San Francisco, California; Washington, DC Full-time Tech Ops-610 Posted Jun 1, 2026
Apply Now →

What it’s like to work at Okta

Identity & Security · San Francisco

3.7
Employee Rating
3.8
Work-Life Balance
342
Open Roles
learningequityproduct-impactdiverse

What employees love

  • Dynamic Work policy — genuinely flexible schedules with no mandatory office days
  • Strong comp and benefits rated 4.0/5 — no-meeting Fridays and competitive equity

What could be better

  • Recent layoffs and restructuring have shaken trust in job security
  • Career opportunities rated 3.4/5 — growth paths could be clearer at this scale
View full Okta culture profile →

About the Role

Secure Every Identity, from AI to Human

Identity is the key to unlocking the potential of AI. Okta secures AI by building the trusted, neutral infrastructure that enables organizations to safely embrace this new era. This work requires a relentless drive to solve complex challenges with real-world stakes. We are looking for builders and owners who operate with speed and urgency and execute with excellence.

This is an opportunity to do career-defining work. We're all in on this mission. If you are too, let's talk.

Position Overview:

We are seeking a highly technical Staff Observability Site Reliability Engineer with a specialty in Splunk to own and evolve our Splunk ecosystem. In this role, you will move beyond simple monitoring to delivering a world class, comprehensive, scalable Observability Platform that enables our SRE teams and business partners. You will treat infrastructure as code—utilizing Terraform and strong coding proficiency in Go, Python, or Ruby—to automate the deployment of agents and collectors across complex distributed systems.

Key Responsibilities

  • Automated Infrastructure: Design, build, and maintain scalable observability infrastructure using tools like Terraform.
  • Splunk Engineering: Optimize the collection, processing, and storage of log data to ensure high reliability and low latency of our Splunk services
  • Incident Response: Participate in on-call rotations and lead post-incident reviews to drive systemic improvements and "observability-driven development."
  • Automation: Eliminate "toil" by automating the deployment and scaling of observability agents and collectors.

Required Skills & Experience (The Essentials)

Log Management: Minimum 5+ Experience scaling and managing Splunk Cloud at scale (1000+ SVCs), including Workload Management (WLM) and HEC optimization. Visualization: Expertise in creating intuitive, actionable Splunk dashboards that correlate data across multiple sources.
SRE Mindset: Minimum 5+ years of experience in an SRE, DevOps, or Systems Engineering role with a focus on high-availability systems.

  • Programming Proficiency: Strong coding skills in SPL, Go for building internal tools and automating workflows.
  • Distributed Systems: Deep understanding of Linux internals, networking (TCP/IP, DNS, Load Balancing), and container orchestration (Kubernetes/EKS).
  • Problem Solving: A data-driven approach to debugging complex, cross-service performance bottlenecks.

Bonus Skills (The "Nice-to-Haves")

  • Telemetry Standards: Hands-on experience with OpenTelemetry (OTel), Vector, or similar frameworks for instrumenting applications.
  • Charge-back app: Experience in implementing Splunk charge-back app for usage reporting

Cloud Platforms: Experience managing observability native tools within AWS or GCP.

Additional requirements:

  • This position requires the ability to access federal environments and/or have access to protected federal data. As a condition of employment for this position, the successful candidate must be able to submit documentation establishing U.S. Person status (e.g. a U.S. Citizen, National, Lawful Permanent Resident, Refugee, or Asylee. 22 CFR 120.15) upon hire.
  • This person must attend in person onboarding in our San Francisco office the first week of employment.

#LI-MM

#LI-Hybrid
P14596_3372199

Below is the annual base salary range for candidates located in California (excluding San Francisco Bay Area), Colorado, Illinois, New York and Washington. Your actual base salary will depend on factors such as your skills, qualifications, experience, and work location. In addition, Okta offers equity (where applicable), bonus, and benefits, including health, dental and vision insurance, 401(k), flexible spending account, and paid leave (including PTO and parental leave) in accordance with our applicable plans and policies. To learn more about our Total Rewards program please visit: https://rewards.okta.com/us.

The annual base salary range for this position for candidates located in California (excluding San Francisco Bay Area), Colorado, Illinois, New York, and Washington is between:
$194,000$267,000 USD


The Okta Experience

We are intentional about connection. Our global community, spanning over 20 offices worldwide, is united by a drive to innovate. Your journey begins with an immersive, in-person onboarding experience designed to accelerate your impact and connect you to our mission and team from day one.

Okta is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, marital status, age, physical or mental disability, or status as a protected veteran. We also consider for employment qualified applicants with arrest and convictions records, consistent with applicable laws.

If reasonable accommodation is needed to complete any part of the job application, interview process, or onboarding please use this Form to request an accommodation.

Notice for New York City Applicants & Employees: Okta may use Automated Employment Decision Tools (AEDT), as defined by New York City Local Law 144, that use artificial intelligence, machine learning, or other automated processes to assist in our recruitment and hiring process. In accordance with NYC Local Law 144, if you are an applicant or employee residing in New York City, please click here to view our full NYC AEDT Notice.

Okta is committed to complying with applicable data privacy and security laws and regulations. For more information, please see our Personnel and Job Candidate Privacy Notice at https://www.okta.com/legal/personnel-policy/.

Similar Roles

More at Okta
Senior Manager, Site Reliability Engineering (Federal)
Washington, DC
Senior Manager, Site Reliability Engineering - Infrastructure Platform
San Francisco, California
Senior Site Reliability Engineer (Auth0)
Toronto, Ontario, Canada
Senior Site Reliability Engineer (Auth0)
Barcelona, Spain
SRE Operations Engineer
Bengaluru, India
Similar roles at other companies
Member of Technical Staff (Software Engineer, Monetization)
Perplexity AI · San Francisco
Staff Software Engineer, Product
Replit · Foster City, CA
Member of Technical Staff - Systems
Modal · New York
Staff Software Engineer, Frontend
Suno · Boston
Staff Data Engineer
Vanta · Remote U.S.

Frequently Asked Questions

What is the work-life balance like at Okta?
Okta has a work-life balance score of 3.8/5 based on employee reviews. This is about average for the AI/tech industry.
What is Okta’s culture like?
Okta is characterized by these culture values: learning, equity, product-impact, diverse. Based on employee reviews, the company has an overall rating of 3.7/5. Dynamic Work policy — genuinely flexible schedules with no mandatory office days
How many open roles does Okta have?
Okta currently has 342 open roles across departments including engineering, product, sales, and more. Roles are refreshed daily from their careers page.
Is this role remote-friendly?
This role is located in Bellevue, Washington; New York, New York; San Francisco, California; Washington, DC. Check the job description above for specific location and remote work details.
Apply for this role at Okta →