Fulltime Site Reliability Engineer openings in Los Angeles on September 10, 2022

Principal Site Reliability Engineer – REMOTE at OCTO CONSULTING GROUP

Location: Los Angeles

Octo is an industry-leading, award-winning provider of digital services for the federal government. Octo specializes in providing agile software engineering, user experience design, cloud services, and digital strategy services that address government’s most pressing missions. Octo delivers intelligent solutions and rapid results, yielding lower costs and measurable outcomes.

Our team is what makes Octo great. At Octo youll work beside some of the smartest and most accomplished staff youll find in your career. Octo offers fantastic benefits and an amazing workplace culture where you will feel valued while you perform mission critical work for our government. Voted one of the region’s best places to work multiple times, Octo is an employer of choice!

Job Description

You

As a Site Reliability Engineer at Octo, you shall be able to build and maintain infrastructure as code on large scale multi-site deployments. The SRE shall utilize their experience to evaluate and assess new ways to scale platform capabilities. The SRE shall be able to automate workflows to help push the limit of the infrastructure and enable continuous delivery of capabilities onto a hybrid infrastructure. The engineer shall be able to troubleshoot issues until root causes are understood on high traffic production systems, participate in design and code review processes, interact with product owners to coordinate infrastructure changes and be responsible for identifying bottlenecks and improving performance of the platform.

Us

We were founded as a fresh alternative in the Government Consulting Community and are dedicated to the belief that results are a product of analytical thinking, agile design principles and that solutions are built in collaboration with, not for, our customers. This mantra drives us to succeed and act as true partners in advancing our client’s missions.

Program Mission

Provide macro-level architectural solutions of a multi-region hybrid infrastructure (on-prem and commercial cloud), a cross-cutting application and integration architecture, and a data management and analytics architecture. This effort will provide the Air Force with feature strategic team members integrated with the Kessel Run teams to support a culture of continuous improvement across the teams. Focus on providing DevSecOps software engineering expertise, Kubernetes infrastructure, and multi-region platform services that help modernize Legacy software systems and supporting external DoD software initiatives.

Skills & Requirements

What we’d like to see
• Five-plus (5+) years, building and maintaining Kubernetes clusters across hybrid-cloud infrastructure
• Eight-plus (8+) years of experience working in Operations, DevOps, or Site Reliability Engineering
• Five-plus (5+) years in configuration/package management experience using tools like Terraform, Helm etc.
• Five-plus (5+) years experience with Cloud service monitoring like Prometheus, Grafana, FluentD, ElasticStack, Prometheus, SumoLogic, etc.
• Exceptionally proficient (knowledge and work experience) in Linux system administration
• Ability to assist with GitLab CI pipelines (build/promote artifacts and security scans)
• Experience creating automation using APIs from Azure or Google Cloud
• Three-plus (3+) years experience with infrastructure and service monitoring tools like Prometheus, Grafana, FluentD, ElasticStack, Prometheus, SumoLogic, etc.

Desired Skills:
• Effective communication skills to interact with various stakeholders internal and external to the organization.

Years of Experience: 8 years of experience or more

Education: Bachelor’s Degree in a Technical Discipline – Computer Science, Mathematics, or equivalent technical degree

Clearance: U.S. Citizenship required, DoD Secret or higher preferred

Octo is an Equal Opportunity/Affirmative Action employer. All qualified candidates will receive consideration for employment without regard to disability, protected veteran status, race, color, religious creed, national origin, citizenship, marital status, sex, sexual orientation/gender identity, age, or genetic information. Selected applicant will be subject to a background investigation.
Apply Here
For Remote Principal Site Reliability Engineer – REMOTE roles, visit Remote Principal Site Reliability Engineer – REMOTE Roles

********

Senior Site Reliability Engineer – Storage – Remote at Akamai Technologies

Location: Los Angeles

We design, deploy, and manage applications and infrastructure that supports Akamais internal and customer-facing cloud storage platforms. We do this while maintaining Akamais mission to make life better for billions of people, billions of times a day.

As a Senior Site Reliability Engineer – Storage, youll collaborate with operations and development teams to build and manage our scalable storage platforms, including Block Storage, Object Storage, and backups. Youll create tooling to automate the lifecycle of petabyte-scale storage systems. Youll work with open-source technologies, including Ceph, to ensure Akamais storage systems are reliable, available, and performant.

As a Senior Site Reliability Engineer – Storage, you will be responsible for:

Architecting new highly available storage systems and infrastructure, supporting a variety of workloads from compute customers

Automating complex workflows and new deployments with Bash/Python and Saltstack/Ansible, increasing the reliability of our storage platforms

Engaging and networking with Ceph developers and users, contributing back to the open-source community

Identifying bottlenecks within the OSI model, improving performance and reliability wherever possible in software and hardware

Tuning Ceph, the Linux kernel, and server hardware, maximizing performance for our customers

Testing and deploying new Ceph features, releases, and bug fixes; automating regression testing for components within our storage platform

Working closely with our hardware engineering teams to research, benchmark, and validate next-generation hardware builds

Do what you love

To be successful in this role you will:

5 years of relevant experience and a Bachelors degree or its equivalent in work experience

Have professional experience in a Site Reliability, Development, or Systems Engineering role, preferably with large scale distributed systems such as Ceph

Be familiar with benchmarking tools like FIO, and concepts like IOPS, throughput, 99th percentile latency, and tail latency

Have professional experience benchmarking and tuning bare metal Servers for maximum performance

Have experience with automation tools such as Terraform, Ansible, Jenkins, or Salt Stack

Have experience troubleshooting Linux systems with tools like tcpdump, iostat, strace, iftop, netstat, and iotop

Have experience with designing, deploying, and running mission-critical Linux Servers at scale

Work in a way that works for you

FlexBase, Akamais Global Flexible Working Program, is based on the principles that are helping us create the best workplace in the world. When our colleagues said that flexible working was important to them, we listened. We also know flexible working is important to many of the incredible people considering joining Akamai. FlexBase, gives 95% of employees the choice to work from their home, their office, or both (in the country advertised). This permanent workplace flexibility program is consistent and fair globally, to help us find incredible talent, virtually anywhere. We are happy to discuss working options for this role and encourage you to speak with your recruiter in more detail when you apply.
Apply Here
For Remote Senior Site Reliability Engineer – Storage – Remote roles, visit Remote Senior Site Reliability Engineer – Storage – Remote Roles

********

Site Reliability Engineer – Telecommute at UnitedHealth Group

Location: Los Angeles

Careers at Solutran, part of the Optum and UnitedHealth Group family of businesses. We create direct spending solutions driven by our extensive financial tech experience to help those we serve be healthier, happier and more productive. Our platform helps members manage their health plans, supplemental benefits and rewards all in one place. You’ll have the opportunity to make it easier for consumers to manage their own health by making healthier products more affordable and their purchases streamlined. If you are a driven individual that thrives in fast-paced environments, values diversity and wants meaningful work that impacts the lives of many, then this is the team for you. Being part of an organization that makes healthier living easier for others leads to your life’s best work.(sm)

Combine Fintech and Healthcare, two of the fastest-growing fields on the planet, with a culture of performance, collaboration, and opportunity and this is what you get – leading edge technology that is improving the lives of millions. Here, innovation is not about another gadget; it is about providing advanced, state-of-the-art payment solutions to help those we serve be healthier, happier, and more productive.

Solutran, a subsidiary of UnitedHealth Group, is a leading Fintech company committed to creating game-changing, customer-friendly solutions. Solutran’s success is grounded in S3®, our proprietary financial network, which is used by the nation’s largest health plans, employers, retailers, and government entities. As a FinTech industry leader, our Solutran S3®platform helps members manage their supplemental benefits, products and services discounts, and rewards all in one place – through a single card, app, and website.

Solutran has established a reputation for delivering modern, advanced customer experiences through best-in-class payment solutions used by millions.

As a Software Engineer, you’re expected and empowered to be your best, to grow and develop your skills. Be prepared to move quickly, to engage proactively, and to think strategically. As a Site Reliability Engineer, you will help deliver insights and automated solutions to help our systems display world-class reliability. This specific position is for someone who loves to dig in and improve production systems. You will help the team improve the deployment pipeline, triage and support our production systems as well as apply your programming skills to develop and create solutions which improve application monitoring and performance.

The Site Reliability Engineer will work on a team dedicated to improving our application service levels while providing a rich feature set, high availability, and performance for both internal and external customers. This position is part of a team that is the first line of support for ensuring our systems are up and running at their best. Work will be done in an environment where applications may rely on a blended infrastructure of both on-premise and cloud deployments.

You’ll enjoy the flexibility to telecommute* from anywhere within the U.S. as you take on some tough challenges.

Primary Responsibilities:
• Develop and improve the tools and services necessary to support the DevOps model and eliminate manual work through automation including implementing practices that support Agile and Continuous Integration/Continuous Delivery (CI/CD) principles
• Help improve customer experience and application performance through quantitative service monitoring, alerting, and the use of data and operational dashboards
• Help define and achieve the Service Level Objective (SLO) for our applications and services
• Create and build necessary tools or processes required to meet our SLOs
• Provide support for high priority incidents (P1 and P2) impacting production and disaster recovery situations to ensure system recovery within SLA
• Perform and participate in ITSM tasks without supervision, and able to provide direct feedback to improve ITSM processes
• Documentation and diagrams, outlining systems capabilities, processes, and environments in a manner that others can understand
• Work on team projects, as well as individually assigned projects
• Off hours support as needed – i.e. deployments
• Participate in an on-call rotation with other Solutran Technical Service staff
• Other duties as assigned

An individual in this position must be able to successfully perform the essential duties and responsibilities listed above. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions of this position.

Primary Platforms:
• CI/CD Pipeline development
• Build Tools – Jenkins / Octopus / LiquiBase/ Azure DevOps/ GitHub Actions
• .Net Programming
• Logging and Monitoring – Splunk/ DataDog
• Oracle & PL/SQL
• Source Control – git / tfs / BitBucket

You’ll be rewarded and recognized for your performance in an environment that will challenge you and give you clear direction on what it takes to succeed in your role as well as provide development for other roles you may be interested in.

Required Qualifications:
• Undergraduate degree or equivalent experience
• 3+ years of professional IT experience (system administration, software development, or a combination), with steadily increasing responsibilities
• Experience building highly available and reliable systems with C#, .Net and IIS
• Experience with one or more modern monitoring and log forwarding tools such as DataDog or Splunk
• Experience with CI/CD methodologies and enabling automation for deployments

Preferred Qualifcations:
• Worked in a Site Reliability Engineering or DevOps role
• Clear understanding of security best practices and secure programing including use of Veracode
• Use of Ticketing and Alerting Tools – Jira/ FreshService/ PagerDuty/Splunk On-call
• CI/CD pipeline development using GitHub Actions
• PowerShell and Python Scripting
• Performance testing using Blazemeter or similar tool
• Oracle Cloud or Terraform Experience
• Administration of IIS Web Servers including SSL Certificates and ciphers
• Excellent customer service and communication skills

To protect the health and safety of our workforce, patients and communities we serve, UnitedHealth Group and its affiliate companies require all employees to disclose COVID-19 vaccination status prior to beginning employment. In addition, some roles and locations require full COVID-19 vaccination, including boosters, as an essential job function. UnitedHealth Group adheres to all federal, state and local COVID-19 vaccination regulations as well as all client COVID-19 vaccination requirements and will obtain the necessary information from candidates prior to employment to ensure compliance. Candidates must be able to perform all essential job functions with or without reasonable accommodation. Failure to meet the vaccination requirement may result in rescission of an employment offer or termination of employment

Careers with Optum. Here’s the idea. We built an entire organization around one giant objective; make health care work better for everyone. So when it comes to how we use the world’s large accumulation of health-related information, or guide health and lifestyle choices or manage pharmacy benefits for millions, our first goal is to leap beyond the status quo and uncover new ways to serve. Optum, part of the UnitedHealth Group family of businesses, brings together some of the greatest minds and most advanced ideas on where health care has to go in order to reach its fullest potential. For you, that means working on high performance teams against sophisticated challenges that matter. Optum, incredible ideas in one incredible company and a singular opportunity to do your life’s best work.(sm)
• All Telecommuters will be required to adhere to UnitedHealth Group’s Telecommuter Policy.

Colorado, Connecticut or Nevada Residents Only: The salary range for Colorado residents is $66,100 to $118,300. The salary range for Connecticut / Nevada residents is $72,800 to $129,900. Pay is based on several factors including but not limited to education, work experience, certifications, etc. In addition to your salary, UnitedHealth Group offers benefits such as, a comprehensive benefits package, incentive and recognition programs, equity stock purchase and 401k contribution (all benefits are subject to eligibility requirements). No matter where or when you begin a career with UnitedHealth Group, you’ll find a far-reaching choice of benefits and incentives.

Diversity creates a healthier atmosphere: UnitedHealth Group is an Equal Employment Opportunity/Affirmative Action employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, age, national origin, protected veteran status, disability status, sexual orientation, gender identity or expression, marital status, genetic information, or any other characteristic protected by law.

UnitedHealth Group is a drug – free workplace. Candidates are required to pass a drug test before beginning employment.
Apply Here
For Remote Site Reliability Engineer – Telecommute roles, visit Remote Site Reliability Engineer – Telecommute Roles

********

Senior Site Reliability Engineer- Remote at HealthEquity, Inc.

Location: Los Angeles

We are looking for a passionate Senior Site Reliability Engineer to join our team in Draper, Utah. Our team is responsible for driving scalable architecture, minimizing risks, providing visibility across a multitude of environments, systems and applications while using lean principles at scale in a fast-paced environment. Youll contribute to the design and documentation of systems, in collaboration with scrum teams, looking for opportunities to automate away waste. Youll work with scrum teams to troubleshoot complicated systems and applications and will partake in an on-call rotation.

What you’ll be doing
• Work with teams to design and implement automated code deployment solutions
• Work with teams to design and implement automated environment provisioning and container solutions
• Work with teams to design and implement application monitoring and alerting solutions to get issues to the right people at the right time
• Work with teams to remediate issues that impact the health and performance of our production systems and infrastructure
• Work with teams to diagnose and isolate issues at all layers of the stack, whether it be code or infrastructure, during development and in production
• Manage build definitions and hardware in support of our Continuous Delivery policies and procedures

What you will need to be successful
• Bachelors degree in CS/Engineering or equivalent experience
• 8+ years experience in a DevOps, SRE, or IT Operations position
• 2+ years experience writing SQL queries and Stored Procedures
• 2+ years experience developing in .NET and C#
• Demonstrated interpersonal skills and ability to collaborate with product owners and development teams
• Demonstrated ability to context switch while still delivering on commitments
• Ability to troubleshoot complex systems and environments
• Experience with CI/CD concepts and tooling
• Knowledge of full stack monitoring concepts and tooling from code to system resources
• Experience with containerization design concepts and tooling

Benefits and perks
• Medical, Dental, Vision
• 401(k) match
• Paid Maternity/Paternity leave
• Ongoing education
• Tuition Assistance
• Gym/Fitness Reimbursement
• Purple with Purpose (paid volunteer time off)
• HSA contribution and match
• On site Lunch and Learns
• Award winning Wellness Program
• Consumer Driven Healthcare (CDH) education

Why work for HealthEquity

HealthEquity has a vision that by 2030 we will make HSAs as wide-spread and popular as retirement accounts. We are passionate about providing a solution that allows American families to connect health and wealth and build health savings for life. Through our innovative technology and superior service delivery, our members gain valuable insights to better save and spend their healthcare dollars.

We firmly believe that our team members drive the success of this company. We hire passionate contributors who enjoy the thrill of pioneering their positions to their full potential. Join us and discover a work experience where the person is valued more than the position, and where are our purple culture drives a remarkable experience.

Our advice to you

HealthEquity is fiercely focused on hiring passionate individuals to contribute to our purple culture. If you speak passion, excellence, service, ambition, fun we want to speak with you! We believe that your personality is as important as your experience and qualifications so when we do have the opportunity to speak together, be authentic, be genuine, be you! Showcase your experience and your passion.
Apply Here
For Remote Senior Site Reliability Engineer- Remote roles, visit Remote Senior Site Reliability Engineer- Remote Roles

********

Senior Site Reliability Engineer at Godaddy

Location: Los Angeles

Location

At GoDaddy the future of work looks different for each team. Some teams work in the office full-time; others have a hybrid arrangement (they work remotely some days and in the office some days) and some work entirely remotely.

This position may be a hybrid or fully remote position, as decided by your manager. If designated as hybrid, you’ll divide your time between working remotely from your home and an office location, so you should live within commuting distance. If designated as remote, you’ll be working remotely from your home and may occasionally visit a GoDaddy office to meet with your team for events or offsites. Your hiring manager can share more about this role’s hybrid or remote designation.

This position is not eligible to be performed in Alaska, Mississippi, North Dakota, or the Virgin Islands.

Join our team

As a Certificate Authority (CA), GoDaddy is one of a handful of companies that provides the trust that the internet is built upon. The GoDaddy CA currently issues certificates for websites, code executables, and even operating system drivers. This type of security is critical to all businesses as it provides a way to secure data and provide assurances of security to their customers. As the world pushes for more devices and services to be online based, providing a high standard for digital security becomes critical.

The PKI Site Reliability Engineering team is looking for an experienced system administrator to join their team. The focus of this team is supporting the PKI development group and their applications as well as designing, improving and maintaining infrastructure for the PKI environment.

What you’ll get to do
• Conduct performance analysis and monitoring of multiple applications tiers and environments
• Identify, assist in planning and implement solutions that continually improves performance
• Identify, assist in creating and maintain automation which allows for self-healing and incident resolution
• Code contributions in languages such as Python and Bash
• Alert response for performance related thresholds
• Address security vulnerabilities and production outages within strict SLA’s
• On-call rotations and Incident call handling
• Contribute to configuration management
• Create, package, maintain, secure and containerize SRE applications
• Collaborate with internal customers, auditors, and other teams
• Perform CA related maintenance and Ceremonies

Your experience should include…
• Skilled with Bash, python scripting.
• Experience troubleshooting Linux server related issues.
• Basic understanding of configuration management (ie. Puppet, Chef, Salt, Ansible)
• Basic understanding of how TCP/IP and the Network stack functions
• Exhibits good communication skills (ie. Written, verbal, documentation)
• Heavy experience with *Nix systems (primary focus on RHEL/CentOS/Alma Linux)
• Basic understanding of SQL query syntax
• Be comfortable working within a strictly regulated environment and following set internal and external mandatory policies
• Exhibits critical thinking in approach and implementation of Duties
• Is expected to be a self starter, and self motivated.

You may also have…
• Basic understanding of how a PKI hierarchy is setup
• Basic understanding of certificate validation processes (ie. OCSP, CRL)
• Experience with a variety of virtualization/container technologies (Kubernetes, Docker, VMWare, OpenStack, AWS, KVM, Digital Ocean Droplets, GH Actions etc.)
• Experience with using HSMs (Hardware Security Modules)
• Experience with Algorithms, including RSA, AES, ECDSA
• Experience with FIPS ( 140, 186 ) compliance
• Experience with other scripting/programming languages (Go, Ruby, Python, Groovy, C, Nodejs, Perl etc.)
• Experience working in an agile environment
• 5+ years working in a large scale environment
• Experience troubleshooting production issues in a geo-distributed application or CDNs with a focus on data integrity
• Prior work using any of the following: ElasticSearch, Openstack, Jenkins, Icinga2, and Salt

We’ve got your back… Enjoy our many benefits (My Wallet), including paid time off, 401k, equity grants and parental leave. Join one of our employee resource groups (Culture). Continue to have a side hustle, if you have one (we love entrepreneurs, remember?). Most importantly, come as you are and make your own way.

About us… GoDaddy is empowering everyday entrepreneurs around the world by providing all of the help and tools to succeed online. GoDaddy is the place people come to name their idea, build a professional website, attract customers, sell their products and services, and manage their work. Our mission is to give our customers the tools, insights and the people to transform their ideas and personal initiative into success. To learn more about the company, visit About Us (https://aboutus.godaddy.net/about-us/overview/default.aspx.)

GoDaddy is proud to be an equal opportunity employer. We will not discriminate against any applicant or employee on the basis of age, race, color, ethnicity, national origin, citizenship, religion, creed, sex, sexual orientation, gender, gender identity or expression (including against any individual that is transitioning, has transitioned, or is perceived to be transitioning), marital status or civil partnership/union status, physical or mental disability, medical condition, pregnancy, childbirth, genetic information, military and veteran status, or any other basis prohibited by applicable federal, state or local law. GoDaddy will consider for employment qualified applicants with criminal histories in a manner consistent with local and federal requirements.

If you need help completing an application for a position with GoDaddy, please reach out to our Recruiting Team at myrecruiter@godaddy.com.

GoDaddy doesn’t accept unsolicited resumes from recruiters or employment agencies.
Apply Here
For Remote Senior Site Reliability Engineer roles, visit Remote Senior Site Reliability Engineer Roles

********

Senior Site Reliability Engineer at Motion Recruitment

Location: Los Angeles

100% Remote full time Senior SRE opportunity based out of Los Angeles. Join a team of top engineers to help coordinate and service half a billion monthly users on the company””s website and mobile app. This position is centered around one””s ability to manage high traffic production deployment and AWS technology.

This position has upward trajectory within the company and will be in an environment that promotes fostering both professional and personal development. The ideal candidate will have strong experience with AWS technologies, Cloud deployments, CI/CD pipelines, container/orchestration and microservices-based architecture. This opportunity will offer a chance to work on a team of world class engineers where continuing to learn and develop on the job is encouraged. Along with a competitive salary and bonuses, this role offers adequate work/life balance.

Required Skills & Experience
• 12+ years experience
• Commercial software development
• Leadership and operations
• 5+ years experience
• AWS technologies (EKS, CDK, RDS, ECS, Dynamo)
• Kubernetes
• 3+ years experience
• Java (developing and debugging highly-concurrent, high-throughput systems)
• Computer Science (data structures, algorithms, Object Oriented Software Design)

Bonus Skills & Experience
• Hive / Spark / Snowflake / Hadoop
• Datacenter-to-cloud migrations
• RESTful APIs and GraphQL
• Terraform & Atlantis
• Apache Solr and/or ElasticSearch
• Kafka, ActiveMQ
• Nodejs, ES6, front-end technologies
• Istio
Benefits
• Competitive Salary & Annual Bonus
• Stock grant
• Handsome PTO
• 401k Matching
• Medical & Dental
• Wellness & Fitness Benefit
• Personal Travel reimbursement
#LI-NP2 – provided by Dice
Apply Here
For Remote Senior Site Reliability Engineer roles, visit Remote Senior Site Reliability Engineer Roles

********

Site Reliability Engineer​/SRE at Vings Technologies

Location: Los Angeles

Position: Site Reliability Engineer (SRE) For Application

“Bachelor’s or master’s in computer science, with s of SRE-related work experience with proficiency in the below skills as indicated:
• Integrate, maintain and automate application operations as SRE – Experienced
• Production engineering (SLO/SLA, OnCall, Incident response, Monitor Performance – Tuning and Improvement ideas) – Experienced
• Linux application and performance troubleshooting skills and Automation scripting – Expert
• Python automation and API development – Expert
• Container-based applications operations management – Experienced
• Microservices architecture, CI/CD pipeline, and Canary deployments – Competent
• Kubernetes, MQs/Kafka and Databases – Competent
• Test and examine code and analyze results – Competent
• Adapt to client technologies and homegrown tooling – Required
• Handle OnCall responsibility and support a wide array of services and functional areas – Required
• Excellent analytical skills and communication skills and a great team player – Required

Additional nice to have skills
• Familiarity/Experience in programming using Go Lang
• MySQL and/or Redis expertise
• Experience with building Grafana dashboards
• Experience with working on Large scale global systems
• Customer facing experience

“Bachelor’s or master’s in computer science, with s of SRE-related work experience with proficiency in the below skills as indicated:
• Integrate, maintain and automate application operations as SRE – Experienced
• Production engineering (SLO/SLA, OnCall, Incident response, Monitor Performance – Tuning and Improvement ideas) – Experienced
• Linux application and performance troubleshooting skills and Automation scripting – Expert
• Python automation and API development – Expert
• Container-based applications operations management – Experienced
• Microservices architecture, CI/CD pipeline, and Canary deployments – Competent
• Kubernetes, MQs/Kafka and Databases – Competent
• Test and examine code and analyze results – Competent
• Adapt to client technologies and homegrown tooling – Required
• Handle OnCall responsibility and support a wide array of services and functional areas – Required
• Excellent analytical skills and communication skills and a great team player – Required

Additional nice to have skills
• Familiarity/Experience in programming using Go Lang
• MySQL and/or Redis expertise
• Experience with building Grafana dashboards
• Experience with working on Large scale global systems
• Customer facing experience
Apply Here
For Remote Site Reliability Engineer​/SRE roles, visit Remote Site Reliability Engineer​/SRE Roles

********

Site Reliability Engineer at Logic20/20 Inc.

Location: Los Angeles

Job Description

We’re expanding capabilities for a major any by building an API layer that enables third-party retailers like Best Buy, Walmart, and Apple to activate devices directly in-store. Our delivery team is well-versed in agile methodology, driving the very best practices in the industry. Providing high-value development with a focus on meeting and exceeding the client’s objectives, this project emphasizes stakeholder communication to effectively translate the mission into a technical solution.

As Site Reliability Engineer, you’ll leverage a diverse set of technical expertise, including Dev Ops tooling, troubleshooting and debugging skills, cloud computing, coding, and infrastructure engineering to provide outstanding customer experiences. You’ll work in highly collaborative teams, leveraging cross-group collaboration skills, communication, and relationship-building skills.

About the team

The Logic
20/20 Digital Transformation team applies design thinking and next-gen technologies to solve our clients’ toughest business challenges. You’ll work side-by-side with architects, managers, and engineering consultants to gain a 360-degree perspective of the challenge e contributing your unique perspective to develop innovative solutions.

About you

You have hands-on experience creating and implementing monitoring and alerting mechanisms.

You’re an expert at improving CI/CD processes.

You have good communications skills, and you’re able to communicate technical requirements to business teams.

You’re an expert at backlog management and have the proven ability to identify production issue patterns and propose solutions to address problems.

You have extensive experience managing and maintaining security and access to applications.
Apply Here
For Remote Site Reliability Engineer roles, visit Remote Site Reliability Engineer Roles

********

Senior Site Reliability Engineer- Remote at HealthEquity, Inc.

Location: Los Angeles

We are looking for a passionate Senior Site Reliability Engineer to join our team in Draper, Utah. Our team is responsible for driving scalable architecture, minimizing risks, providing visibility across a multitude of environments, systems and applications while using lean principles at scale in a fast-paced environment. Youll contribute to the design and documentation of systems, in collaboration with scrum teams, looking for opportunities to automate away waste. Youll work with scrum teams to troubleshoot complicated systems and applications and will partake in an on-call rotation.

What you’ll be doing
• Work with teams to design and implement automated code deployment solutions
• Work with teams to design and implement automated environment provisioning and container solutions
• Work with teams to design and implement application monitoring and alerting solutions to get issues to the right people at the right time
• Work with teams to remediate issues that impact the health and performance of our production systems and infrastructure
• Work with teams to diagnose and isolate issues at all layers of the stack, whether it be code or infrastructure, during development and in production
• Manage build definitions and hardware in support of our Continuous Delivery policies and procedures

What you will need to be successful
• Bachelors degree in CS/Engineering or equivalent experience
• 8+ years experience in a DevOps, SRE, or IT Operations position
• 2+ years experience writing SQL queries and Stored Procedures
• 2+ years experience developing in .NET and C#
• Demonstrated interpersonal skills and ability to collaborate with product owners and development teams
• Demonstrated ability to context switch while still delivering on commitments
• Ability to troubleshoot complex systems and environments
• Experience with CI/CD concepts and tooling
• Knowledge of full stack monitoring concepts and tooling from code to system resources
• Experience with containerization design concepts and tooling

Benefits & Perks
• Medical, Dental, Vision
• HSA contribution and match
• Dependent Care FSA match
• Unlimited Paid Time Off
• 401(k) match
• Paid Parental Leave
• Ongoing Education?& Tuition Assistance
• Gym/Fitness Reimbursement
• Award Winning Wellness Program

Come be your authentic self

Why work for HealthEquity

HealthEquity has a vision that ? by?2030 we will make HSAs as wide-spread and popular as retirement accounts. ? We are passionate about providing a solution that allows American families to ? connect health and wealth . Join us and discover a work experience where the person is valued more than the position. Click here to learn more.

Come be your authentic self

HealthEquity, Inc. is an equal opportunity employer that is committed to inclusion and diversity. We take affirmative action to ensure equal opportunity for all applicants without regard to race, age, color, religion, sex, sexual orientation, gender identity, national origin, status as a qualified individual with a disability, veteran status, or other legally protected characteristics. HealthEquity is a drug-free workplace
Apply Here
For Remote Senior Site Reliability Engineer- Remote roles, visit Remote Senior Site Reliability Engineer- Remote Roles

********

Software Engineer​/Site Reliability Engineer at UnitedHealth Group

Location: Los Angeles

Position: Software Engineer / Site Reliability Engineer

United Health Group is a company that’s on the rise. We’re expanding in multiple directions, across borders and, most of all, in the way we think. Here, innovation isn’t about another gadget, it’s about transforming the health care industry. Ready to make a difference? Make yourself us and start doing your life’s best work.(sm)

Optum Labs is the research and development arm of United Health Group. We’re a diverse team of curious thinkers and experts in big data, artificial intelligence and machine learning, and scientific and clinical research searching for new ways to help people live healthier lives. We partner with world leaders in health care delivery, research, and technology to create disruptive solutions that serve patients, caregivers, providers, and commercial and government payers.

You’ll enjoy the flexibility to telecommute
• from anywhere within the U.S. as you take on some tough challenges.

Primary Responsibilities:
• Actively participate in on-lution for the platform and/ndent components which the product engineering teams rely on for their work. This will be no more than 25% of the time. The rest of the time will be automating and developing quality and operational improvements/solutions
• Maintain and improve operational tooling, frameworks, perform chaos engineering activities
• Perform root cause analysis and deliver resolution for tools and automation failures
• Build frameworks that test the performance and resiliency of our platform services/tools
• Build/integrate/administer systems and tools that enable engineering teams to observe their applications in production with autonomy (Dashboards, APMs)
• Automate alerts for metrics on performance, cost, vulnerabilities, risk, compliance violations
• Identify and measure SLOs, SLAs and SLIs
• Improve processes/runbooks and champion automation of any manual items around support

Youll be rewarded and recognized for your performance in an environment that will challenge you and give you clear direction on what it takes to succeed in your role as well as provide development for other roles you may be interested in.

Required

Qualifications:
• BA/BS in Computer Science, Engineering or related field or equivalent experience
• 3+ years developing cloud-native applications using one uages (Typescript, C# ,Java)
• 3+ years deploying and operating cloud-native applications in a public cloud (Azure preferred)
• 2+ years in a role of supporting software and/or cloud-infrastructure in an on-tification and remediation of technical problems e
• In-depth and proactive communication skills around status of projects/issues in production
• Solid Git skills
• Full COVID-19 vaccination is an essential job function of this role. Candidates located in states that mandate COVID-19 booster doses must also comply with those state requirements. United Health Group will adhere to all federal, state and local regulations as well as all client requirements and will obtain necessary proof of vaccination, and boosters when applicable, prior to employment to ensure compliance.

Candidates must be able to perform all essential job functions with or without reasonable accommodation

Preferred

Qualifications:
• 3+ years implementing dashboards to help teams visualize logs, instrumentation, and other data to ensure optimal performance of the platform services, infra, and deployed applications(Grafana preferred)
• Experience with Docker and Kubernetes (Azure Kubernetes Service preferred)
• Experience using centralized logging solutions (Splunk (preferred), Elk, etc.)
• Experience using active monitoring systems (Datadog, New Relic, etc.)
• Experience creating runbooks, processes, and test plans around reliability, performance, etc. of infra/applications
• Experience planning and supporting +99.99% availability against critical applications in production

We Lead with Diversity, Inclusion and Compassion

At Optum Labs, we are dedicated to building teams where every individual is recognized for their unique experience and contributions. Our Leadership Principles underscore our commitment to inclusion, encouraging us to walk in each others shoes and open doors for our peers.

United Health Group supports local, regional, and national organizations that share these values through joint initiatives, event and program participation, volunteerism and giving. Through our Connected Communities, employees can connect with others who have similar – or different – life experiences and backgrounds. These groups are led by peers, supported by Human Capital and championed by leaders.]

We Invest in Talent

Managers at every level are committed to their roles as talent stewards who help guide and nurture professional development. We want our employees to reach their highest level of potential just as they help us reach ours. Join Optum Labs and youll be part of a culture that prizes innovation and works with uncompromising integrity.

At Optum Labs, employees are our first customers. Thats why we offer virtual work environments to provide work/life flexibility via…
Apply Here
For Remote Software Engineer​/Site Reliability Engineer roles, visit Remote Software Engineer​/Site Reliability Engineer Roles

********

The Tech Career Guru
We will be happy to hear your thoughts

Leave a reply

Tech Jobs Here
Logo