Fulltime Site Reliability Engineer openings in California on September 15, 2022

Senior Site Reliability Engineer (AWS) at NEAR

Location: Pasadena

Description

You will be joining Near, one of the fastest growing Enterprise SaaS companies and experience a true start-up culture with the freedom to experiment and innovate. At Near, we believe that great culture is not just about work; it’s work + life. We not only encourage our employees to dream big, but also give them the freedom and the tools to do so.

Near has an immediate need for Sr. Site Reliability Engineer with experience supporting deployments and maintaining high volume, high performance, and high availability websites, applications, and database infrastructures. This role is 100% remote, with an option to work in-office if preferred. The Sr. SRE will report directly to the Director, Site Reliability Engineering.

A Day in the Life:
• Provision and maintain current and new AWS infrastructure using Terraform
• Deploy and support CI/CD build pipelines via CodeBuild, CodePipeline, and Jenkins
• Ensure web application high availability and responsiveness
• Monitor applications, services, logs and databases
• Troubleshoot and remediate infrastructure incidents
• Create, maintain and enhance operational scripts and scheduled jobs
• Deploy IAM roles permissions to support application workloads
• Document infrastructure procedures and contribute to shared knowledge repositories
• Communicate and collaborate with SRE, Engineering and beyond via Slack
• Provide on-call support after business hours, backed up by Engineering

What you bring to the role:

5+ years experience as a seasoned systems engineer with focuses on:
• Terraform automation framework
• Linux systems administration (Ubuntu preferred)
• Supporting 24/7 production systems infrastructure in the AWS ecosystem
• AWS cloud network administration
• Bash shell scripting
• Source code repository management (GitHub preferred)

Deep, proven experience managing the following AWS services:
• Compute: EC2, Lambda, ECS/Fargate
• Network: VPC, Route 53, ALB and CloudFront
• Storage: S3, EFS
• Database: RDS, Redshift
• Monitoring: CloudWatch (all features)
• Compliance: Config, EventBridge
• Security: IAM, GuardDuty
• Ops: AWS CLI

About Near

Near is the world’s largest source of intelligence on people and places, processing data from over 1.6 billion monthly users across 44 countries. TheNear Platform powers data-driven marketing and enrichment offerings through a suite of SaaS products. The users of the platform can leverage audience, spatial, retail, among other data in a privacy-led environment.

Founded in 2012, Near is headquartered in Singapore with offices in California, New York, London, Paris, Bangalore, Tokyo, and Sydney. Today, marquee brands such as News Corp work with Near to provide enhanced customer experiences.

Near is backed by leading investors including Sequoia Capital, JP Morgan Private Equity Group, Cisco Investments, Telstra Ventures, and Greater Pacific Capital.
Apply Here
For Remote Senior Site Reliability Engineer (AWS) roles, visit Remote Senior Site Reliability Engineer (AWS) Roles

********

Site Reliability Engineer at Jobot

Location: San Mateo

This Jobot Job is hosted by Marty Glatz

Are you a fit? Easy Apply now by clicking the “Apply” button and sending us your resume.

Salary $120,000 – $160,000 per year

A Bit About Us

We empower world-class and diverse creative and technical talent from all mediums to entertain global audiences through animated feature films with heartfelt storytelling, captivating worlds, and inspiring characters.

Key to our vision is developing a real-time enabled pipeline that brings intimacy back to the animation process while pushing the creative envelope. Our team has worked on many of the earliest and most successful 3D animated films, and have experienced firsthand the exponential increase in visual complexity and crew sizes as narrative appetite and audience expectations have grown. GPU accelerated rendering and game engines unlock new workflows that can upend long held assumptions underpinning the assembly line approach of legacy studios.

Why join us?

We empower world-class and diverse creative and technical talent from all mediums to produce animated franchises with global appeal and

metaverse potential.

At the core of our vision are feature films with heartfelt storytelling, captivating worlds, and inspiring characters.

We are combining traditional, real-time, and cloud based technologies to-create a next generation feature animation pipeline.

Job Details

The Site Reliability Engineer (SRE) will help support and deploy our next-generation, cloud-based content production pipeline for high-end animation. We are developing a work-from-anywhere approach that will depend on a rock solid systems foundation that integrates the latest cloud technology with our high-power on-premise hardware. The SRE will work with our technical and creative leads to help engineer a multi-regional platform for remote and in-person collaboration. Come and help lay the groundwork for what will become the future of the animation industry.

Responsibilities

Work in a cross-functional team to architect and engineer the most optimal cloud-native and on-prem CI/CD solutions for the studio.

Define workflows and processes to meet Infrastructure-as-Code (IaC) objectives.

Actively facilitate continuous improvement.

Engineer and manage Windows software lifecycle management IaC pipeline.

Architect and manage cloud and on-prem solutions supporting the studio asset-management pipeline.

Proactively monitor and troubleshoot site reliability issues.

Operate Spire’s cloud platform,provide service owner support, and participate in incident escalations.

Stay current with industry trends, making recommendations as needed to help the organization innovate and excel.

Requirements

Experience with Infrastructure-as-Code tools like Terraform and Cloud Formation.

Experience with building VM Images using tools like Packer.

Experience with Config Management tools like Chef, Ansible etc.

Strong software engineering skills, preferably working in multiple programming languages (Go, Python, Javascript).

Proficiency in cloud-native technologies and architectures (Docker, Kubernetes).

Proficiency in revision control and DevOps best practices (Git).

Expert Linux and Windows experience.

Demonstrable scripting experience with a variety of scripting languages for automating tasks, generating reports, and creating tools (e.g. Bash, Python, PowerShell).

Bachelor’s Degree in Computer Science or Engineering or equivalent experience.

Skilled at working in tandem with a team of engineers, or alone as required.

Excellent communication and organizational skills, and the ability to stay focused on completing tasks and meeting goals within a busy workspace.

Bonus

Experience operating Perforce.

Experience with Unreal Engine.

Experience operating high capacity GPU farms.

Media and Entertainment experience.

Passion for real-time rendering and animation.

Experience working with a globally distributed team.

Compensation

Competitive salary.

Go home (finish work) at a decent time to hang out with your family and friends.

Interested in hearing more? Easy Apply now by clicking the “Apply” button.
Apply Here
For Remote Site Reliability Engineer roles, visit Remote Site Reliability Engineer Roles

********

Sr. Site Reliability Engineer at Jobot

Location: San Francisco

100% Remote + Salary up to $190K + Work/Life Balance with Unlimited PTO + Tremendous Benefits Package!

This Jobot Job is hosted by Ryan Jauregui

Are you a fit? Easy Apply now by clicking the “Apply” button and sending us your resume.

Salary $150,000 – $180,000 per year

A Bit About Us

100% Remote Senior Site Reliability Engineer needed for an integration software company. We are a leader in our industry and are transforming how organizations transact business with each other.

Why join us?

We offer an excellent benefits package that has been designed to meet the needs of our diverse workforce. Our package provides employees a consistent, competitive level of benefits with the flexibility to support individual growth within the organization.
• Competitive Base Salary up to $180K, depending on experience.
• Unlimited PTO (Yes, that’s right. Unlimited)
• Medical, Dental, Vision Starting Day 1
• 401K with Company Match
• Continued Training Programs
• 100% Remote
• Full Work From Home set up provided

Job Details

As the Senior SRE, you will serve as a leader in planning, production, and engagement with software developers and infrastructure engineers to integrate software development and delivery.
• Continuous improvement of system and application monitoring and automation.
• Monitoring of infrastructure, systems, and application availability, performance, and capacity.
• Identify and automate manual workarounds and process improvements.
• Monitor the availability, latency, scalability, and efficiency of all services.
• Lead efforts for updating production with new versions/infrastructures as they are available
• Lead capacity planning efforts to determine changes to infrastructure that are needed to support new load and performance characteristics

Preferred Requirements
• BE/MCA in Computer Science or Engineering.
• 3 to 15 years experience in Site Reliability Engineering.
• Knowledge of Amazon S3, EC2, RDS, EFS, ELB, Route 53 is needed.
• Experience in one or more of the following C, C++, Java, Python, Go, Ruby, Scala, NodeJS.
• Experience in Linux and Unix-like operating systems.
• Must be self-directed, flexible, and be able to prioritize and handle multiple projects simultaneously.
• Outstanding problem solving, troubleshooting, and decision-making skills.
• Knowledge of CloudFormation, CloudWatch, CodeDeploy, DynamoDB, Lambda, SQS is a Plus.

Interested in hearing more? Easy Apply now by clicking the “Apply” button.
Apply Here
For Remote Sr. Site Reliability Engineer roles, visit Remote Sr. Site Reliability Engineer Roles

********

Site Reliability Engineer at FalconX

Location: San Mateo

Who are we?

FalconX is one of the fastest-growing startups in FinTech. We are redefining prime brokerage from the ground up.

We are backed by some of the best investors in the world including Accel, American Express, B Capital, Coinbase, Fidelity, Lightspeed Venture Partners, Fenbushi Capital and Tiger Global Management + more yet to be publicly disclosed.

We deliver institutional digital asset traders best-in-class trading, credit, custody and structured products. We trade, lend and secure tens of billions of dollars monthly, are highly profitable, and growing fast, so we need your help!

We are data-driven. Whether it’s a growth or product decision, we believe data can always help us make more precise and informed choices.

We move fast. Speed of execution is essential for any startup, but we believe this is even more pertinent in our 24/7 industry.

We prioritize learning. Outcomes are mission-critical, but we also believe that learning in success and in failure will drive our continued success. Our industry is emergent – there’s no shortage of experiments to get involved with and to continue growing and learning together.

FalconX has offices in San Mateo, Chicago, New York, Bangalore, and Malta.

Who is on the team?

We are entrepreneurs. Many in our company have been founders or have aspirations to eventually start their own company. We take these ambitions and experiences to bring a solutions-oriented mindset to the problems we encounter day-to-day.

We are experienced. We have been fortunate to have learned from mentors and peers at institutions such as Google, LinkedIn, JUMP Trading, Citadel, PEAK6 Investments, Goldman Sachs, Harvard Business School, Carnegie Mellon, IIT + more.

Responsibilities
• Engage in and improve the whole lifecycle of services—from inception and design, through deployment, operation and refinement. Embed with Engineering teams to apply industry best practices.
• Build and manage systems, infrastructure and applications through automation
• Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews
• Maintain services once they are live by measuring and monitoring availability, latency and overall system health
• Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity
• Practice sustainable incident response and blameless postmortems
• Together with your engineering team, you will share an on-call rotation and be an escalation contact for service incidents

Minimum Qualification
• BS with 3 years+ or MS with 1 year working experience as Site Reliability Engineering (SRE)
• Experience with programming in at least one of the following languages: C, C++, Java, Python, or Go.
• Strong skills around observability, debugging and performance tuning, willing to dive into understanding, debugging, and improving any layer of the stack
• Experience in infrastructure like AWS, GCP, Azure, mysql, kubernetes, Docker etc.
• Deep knowledge of Linux internals
• Networking protocols such as TCP/IP

Preferred Qualification
• Extensive experience in supporting production Internet services
• Expertise in designing, analyzing, and troubleshooting large-scale distributed systems
• Systematic problem-solving approach, coupled with effective communication skills and a sense of drive
Apply Here
For Remote Site Reliability Engineer roles, visit Remote Site Reliability Engineer Roles

********

Site Reliability Engineer at Instrumental Inc.

Location: None

We have announced our $50M Series C round! Here is an interview with our CEO Anna that was featured on TechCrunch. https://techcrunch.com/2022/02/16/with-a-50m-series-c-instrumental-looks-to-expand-data-driven-manufacturing-solution/

About Instrumental:

Manufacturing output represents half of the world’s GDP, but 20% of every dollar spent is wasted on scrap, rework, and mistakes. Beyond being wasteful, this is one of the main reasons new hardware products are late or fail to materialize. Instrumental accelerates how the world’s best brands bring new products to market by collecting unique data from assembly lines and feeding it to AI-powered software tools to find and fix manufacturing issues.

Instrumental has helped many household brands (e.g. Bose, Meraki, Honeywell) to significantly shorten innovation cycles and time to go-to-market. We’re growing at venture pace with significant revenue gains in both 2020 and 2021.

About the Role:

As an SRE, you’ll work closely with our engineering and product teams to help build and scale our fast growing infrastructure used by our customers to debug, optimize, and scale their mission critical manufacturing capacity. As a startup, we’re looking for broad generalists in AWS, Terraform, and Linux that have a deep sense of urgency while abiding by the motto “automate all things”.

Here are some examples of projects that you can expect to work on:
• Automate the fleet management of our hardware devices used on assembly lines.
• Standing up isolated single tenant infrastructure to support customer compliance and scaling needs.
• Scripting to automate telemetry, troubleshooting, and alerting.

We’d Love to Chat if You Have:
• Preferred experience using Terraform although will consider Ansible or other declarative configuration management language experience (Puppet, Chef, Salt, or CloudFormation)
• Prior experience using BASH or other shelling scripting language
• Linux knowledge and troubleshooting experience
• A passion for understanding and solving problems
• Previous exposure to AWS or other cloud management experience
• Familiarity with Docker containers
• Great documentation and organizational skills
• Ability to communicate with different stakeholders across the company
• 5+ years of experience as a Systems Administrator or a DevOps Engineer
• Familiarity with network troubleshooting skills in cloud environments (basic protocols, routing)

Culture & Benefits:

We’re a collaborative team that actively works to promote an inclusive environment, valuing passion and learning. We are all highly energized by the opportunity for such a large impact. In addition, we value transparency, which extends even to our interview process. It’s designed to build an understanding of what it would be like to work together with empathy, not to put candidates on the spot.

To help Instrumentalists grow and thrive, we are also proud to offer competitive benefits.

All candidates must have an unrestricted right to work in the U.S.
Apply Here
For Remote Site Reliability Engineer roles, visit Remote Site Reliability Engineer Roles

********

Senior Site Reliability Engineer – Health Tech – Up to $300,000 + unlimited PTO + fully remote! at Hunter Bond

Location: Los Angeles

Title: Senior Site Reliability Engineer

Company: Very fast growing technology healthcare company

Compensation: Up to $300,000 + very good benefits

Location: Remote

Experience in: Python, Golang, Kubernetes, Terraform, AWS, Linux

THE COMPANY

This lot are a very successful and fast growing healthcare company, they have a huge emphasis on technology in their business and need very passionate engineers to help scale the company, solve interesting challenges within the HealthTech world.

They’re able to offer fully remote position from 22 US states and extremely rewarding opportunities.

REQUIREMENTS
• 3 years + experience in software engineering, DevOps or a similiar SRE position
• Strong proficiency in Python programming or Golang
• Strong experience with Kubernetes, Terraform
• Experience with Linux administration
• Experience with AWS infrastructure
• Experience with bare metal infrastructure would be a plus
• A genuine passion for technology!

PERKS
• Able to work fully remote or from the awesome New York office
• The ability to solve very interesting challenges
• Technologists are in charge and report to other technologists
• Work with very talented engieners from MAMAA and finance
• Unlimited PTO
• 401k matching
• Healthcare, dental, vision, life insurnace
• Performance based bonus
• Constant need for learning, self development and progression
• Internal hackerrank events to promote engineer ideas
• A tonne of free food in the office

If interested then please apply with your updated resume ASAP as the client is actively hiring.
Apply Here
For Remote Senior Site Reliability Engineer – Health Tech – Up to $300,000 + unlimited PTO + fully remote! roles, visit Remote Senior Site Reliability Engineer – Health Tech – Up to $300,000 + unlimited PTO + fully remote! Roles

********

Sr. Director, Site Reliability Engineering at Under Armour, Inc.

Location: Sacramento

Sr. Director, Site Reliability Engineering

Date: Sep 14, 2022

Location: Remote, US

Company: Under Armour

Under Armour has one mission: to make you better. We have a commitment to innovation that lies at the heart of everything we do, not just for our athletes but also for our teammates. As a global organization, our teams around the world push boundaries and think beyond what is expected. Together our teammates are unified by our values and are grounded in our vision to inspire you with performance solutions you never knew you needed but can’t imagine living without.

Position Summary

This is a key leadership position which will lead three key functions for global technology. 1. SRE (Site Reliability Engineering), 2. PSR (Performance Scalability and Reliability 3. Support (Key L1,L2 support for all Infrastructure supporting all towers in global technology team and all key engineering systems. The role is crucial to build 2 important functions which are critical part of any high performing Engineering systems. Role will build the SRE team to build productivity tools for engineering teams to increase agility (including but not limited to automation and dev ops tool chain) . These tools will start jumpstart ecomm acceleration. Team will also build critical tools to increase reliability of critical systems and services (e.g. proactive monitoring). Role will build PSR team to focus on systems uptime and scalability. Role will drive Holiday readiness program at Under Armour (not limited to holidays but year round), Automation of performance tests (part of CICD) and DR initiatives for Tier 1 systems. Both these functions are new and need a mind shift and culture change on how we do Engineering at Under Armour. This role will transform the support functions. Role will define operational KPIs which are relevant to the Ecommerce and digital business, build and track dashboards and hold teams accountable for meeting the KPIs.

Essential Duties & Responsibilities
• Develops and defines objectives, methodologies, and metrics for all Under Armour SRE, PSR and support initiatives.
• Define strategy and roadmaps and oversee the development of long-term plans and proposals to achieve business objectives.
• Partners with the enterprise and software architecture function and program management function to ensure the SRE strategy and roadmaps align with the technology roadmaps for our corporate & teammate applications, product & supply chain, retail, ecommerce, martech and connected fitness technology towers.
• Partners with other technology leaders to define an IT operating model that takes advantage of digital innovation across the enterprise.
• Defines CICD pipeline, automation and other productivity tools to ensure Engineers are agile and productive, systems are reliable and scalable
• Define KPIs for support (for infrastructure of all towers and all engineering cloud and on prem systems) to track system heartbeat and customer experience for all retail, warehouse and corporate functions.
• Build the SRE and PSR teams and change the mindset and culture of Engineering operations to increase agility and reduce cost.
• Change the mindset from being reactive to the production issues to be proactive so production system issues are resolved before any impact to customer experience
• Ensures continuous alignment of project investments and initiatives with business strategy based on changing functional needs, resource capacity constraints, risk exposure, and interdependencies
• Manages vendor relationships to ensure continuous innovation & support.
• Manages relationship with managed service providers and ensure adherence to SLAs and overall quality.
• Establishes paths to customer value for our licensed technology and looks for opportunities to sunset or replace underperforming solutions.
• Optimize IT governance, priorities and decision-making processes as the business context and technology landscape changes in partnership with stakeholders
• Develops and defines KPIs, objectives and targets for measuring and improving the quality, security, scalability, maintainability, cost and time to market of our existing new infrastructure and engineering services.
• Owns the regular reviews of systems performance metrics with the IT leadership team and business stakeholders
• Accountable for a high-performing global infrastructure & engineering operation. This include L1,2 support, Holiday readiness and disaster recovery drills, etc.
• Drive standardization and centralization of technologies across the enterprise to capture long-term cost savings and operational improvements
• Implement and maintain controls and monitoring procedures to ensure availability of critical systems, and minimal service interruptions.
• Develops the next generation of technology leaders who are able to build strategic partnerships with internal and external stakeholders to move the business towards digitally enabled growth.
• Develop and lead a high performing SRE, PSR and operations team to ensure the reliable delivery of IT services and operations.
• Drive a collaborative culture that values technical depth, accountability, and customer service.
• Partners with HR to hire, maintain and develop top talent and establish competitive infrastructure career paths.
• Responsible for building a diverse and inclusive organization with a strong culture of accountability, collaboration, innovation and brand affinity

Qualifications (Knowledge, Skills & Abilities)
• Highly knowledgeable of emerging technology and business trends and develops and executes an Engineering operations strategy that takes advantage of these trends, and collaborates with other business leaders to embed digital opportunities in business strategy
• Deep understanding of the key financial drivers and dynamics related to growth and revenue goals.
• Excellent presentation and communication skills, able to convey progress, risks and opportunities across the enterprise including the C-Suite and BOD
• Excellent interpersonal and communication skills and proven ability to work effectively with all organizational levels on a global scale
• Personnel development & management of a team
• Ability to meet multiple deadlines and manage multiple ongoing projects
• Ability to manage large IT budgets
• Strong oral and written communication skills
• Strong initiative, problem solving skills, and decision-making skills
• Deep functional knowledge of Holiday readiness processes in a known retail (or equivalent) business is required.
• Proven track record of defining KPIs, establishing dashboard, bringing cross-functional awareness and bye-in to support those KPIs
• Implementations or major cross functional projects is required either in an implementation or business owner role.
• Creates a culture within and across teams that promotes proactively identification of gaps and pursues opportunities to improve functional capabilities through system or process improvements.
• Provides development opportunities for direct reports to expand skills and influence. Builds relationships with immediate team and across other IT and other business teams to drive cross functional synergies 10-10

Education And / Or Experience
• Bachelor’s Degree in Information System Management, Computer Science, or a relevant business area preferred, but relevant professional experience will be considered. Master’s in Business Administration or Computer Science degree a plus.
• Minimum 15 years of experience in technology leadership positions with an emphasis on operational services. Demonstrated knowledge of current and emerging technologies and the ability to apply those technologies to business needs.
• Minimum 10 years of experience in supporting highly scalable and available systems in Direct to consumer or retail business.
• Minimum 10 yrs of leadership experience leading teams
• Minimum 5 years of experience with vendor management

Other Requirements

Location: Home Office

Return To Work Designation: Fully Remote

Travel: 10 – 15% as required

Licenses/Certifications: ITIL Foundations, Agile training, PMP

Relocation
• No Relocation Provided

At Under Armour, we are committed to providing an environment of mutual respect where equal employment opportunities are available to all applicants and teammates without regard to race, color, religion, sex, pregnancy (including childbirth, lactation and related medical conditions), national origin, age, physical and mental disability, marital status, sexual orientation, gender identity, gender expression, genetic information (including characteristics and testing), military and veteran status, and any other characteristic protected by applicable law. Under Armour believes that diversity and inclusion among our teammates is critical to our success as a global company, and we seek to recruit, develop and retain the most talented people from a diverse candidate pool.

Learn more about Under Armour’s COVID-19 response and Teammate vaccination policies here .
Apply Here
For Remote Sr. Director, Site Reliability Engineering roles, visit Remote Sr. Director, Site Reliability Engineering Roles

********

Senior Site Reliability Engineer at Carta

Location: San Francisco

The Company You’ll Join

Carta is a platform that helps people manage equity, build businesses, and invest in the companies of tomorrow. Our mission is to unlock the power of equity ownership for more people in more places.

Carta is trusted by more than 30,000 companies and over half a million employees in nearly 150 countries to manage cap tables, compensation, and valuations. Carta also supports over 5,000 funds representing over $100B in assets under administration with their venture capital solutions. Carta’s liquidity solutions have returned $13B to shareholders in secondary transactions. Today Carta’s platform manages over two trillion dollars in equity for nearly two million people globally. Companies and funds like Canva, Tribe, and Pipe build their businesses on Carta.

The company has been included on the Forbes World’s Best Cloud Companies, Fast Company’s Most Innovative list, and Inc.’s Fastest-Growing Private Companies. For more information, visit carta.com.

The Team You’ll Work With

The Site Reliability Engineering team (SRE) at Carta is responsible for ensuring the availability, reliability, and resiliency of the Carta app and other production systems in various environments. The team has expertise in systems architecture and design, infrastructure automation using Terraform, AWS and Kubernetes. In addition, the SRE team collaborates closely with the Information Security team on defining secure network boundaries and implementing security policies.

The Problems You’ll Solve
• Develop and maintain Terraform configs, Jenkins pipelines, Kubernetes manifest files as infrastructure as code (IaC) and extend these configurations to support new services, features and multiple environments.
• Solve complex dependencies of critical services of various business units and build automation to prevent future problems. Develop automation scripts to streamline system upgrades and pipelines to improve deployment cycle.
• Maximize and maintain high availability of systems and services while ensuring critical business functions are meeting their SLOs.
• Influence new designs and architecture, best practices and standards in supporting and improving technology platforms.
• Establish monitoring and alerting of production systems and critical applications.
• Participate in our on-call rotation to resolve site incidents and document your findings into repeatable runbooks as part of improving site availability.
• Work cross functionally with a passion to improve developer productivity.
About You

About

We’re optimizing for strong senior engineers with at least 4+ years of relevant experience who are excited about the opportunities to work with a fast moving team, as well as previous experience working with

You will be part of a cross functional team of engineers and product managers, and successful candidates will have extremely high EQ and IQ, with a strong bias towards collaboration .
• Hosting distributed systems on a public cloud providers (GCP or AWS)
• Containerization technologies (specifically, Docker, Kubernetes, Helm )
• Building and working with scalable infrastructures using Linux and Docker containers
• Automation via “infrastructure as code” (using tools like Terraform, Ansible, etc.) and writing scripts in Python and Bash
• GitHub and advanced understanding of CI/CD tooling (Jenkins, CircleCI)
• Production systems monitoring using tools such as Datadog, Grafana etc.

You’ll build reliable infrastructure via code for the Carta app to run on Kubernetes serving sensitive financial data. You will provide performance metrics visibility into the systems and applications via Datadog monitoring. You will leverage your prior experience in designing, building and maintaining infrastructure with reliability as core principle to reduce service failures as it pertains to site performance and availability. You will lead by example to demonstrate team collaboration in timely execution of planned projects enabling swifter delivery of software. You are pragmatic in making tradeoffs between different designs to optimize overall business value and are passionate to elevate the team as part of sharing knowledge and teaching. You have a desire to understand and solve people’s problems instead of simply fulfilling the requests.

We are an equal opportunity employer and are committed to providing a positive interview experience for every candidate. If accommodations due to a disability or medical condition are needed, connect with us via email at recruiting@carta.com . As a company, we value fairness, helpfulness, transparency, leadership and build our teams around these values. Check out our careers page to get to know us better as you think about your next step at Carta.
Apply Here
For Remote Senior Site Reliability Engineer roles, visit Remote Senior Site Reliability Engineer Roles

********

Site Reliability Engineer at SonicJobs

Location: Santa Rosa

Looking for Site Reliability Engineer with 8+ Years of experience.
Strong experience in Prometheus, Grafana, Splunk, Splunk.
Experience in cloud-based technologies and tools in configuration management, deployment, monitoring and operations.
Experience in Docker, Kubernetes, CloudWatch etc.
Strong experience in CI/CD Tools
Bachelors Degree.

Looking for Site Reliability Engineer with 8+ Years of experience.
Strong experience in Prometheus, Grafana, Splunk, Splunk.
Experience in cloud-based technologies and tools in configuration management, deployment, monitoring and operations.
Experience in Docker, Kubernetes, CloudWatch etc.
Strong experience in CI/CD Tools
Bachelors Degree.
Apply Here
For Remote Site Reliability Engineer roles, visit Remote Site Reliability Engineer Roles

********

Senior Site Reliability Engineer – Health Tech – Up to $300,000 + unlimited PTO + fully remote! at Hunter Bond

Location: Irvine

Title: Senior Site Reliability Engineer

Company: Very fast growing technology healthcare company

Compensation: Up to $300,000 + very good benefits

Location: Remote

Experience in: Python, Golang, Kubernetes, Terraform, AWS, Linux

THE COMPANY

This lot are a very successful and fast growing healthcare company, they have a huge emphasis on technology in their business and need very passionate engineers to help scale the company, solve interesting challenges within the HealthTech world.

They’re able to offer fully remote position from 22 US states and extremely rewarding opportunities.

REQUIREMENTS
• 3 years + experience in software engineering, DevOps or a similiar SRE position
• Strong proficiency in Python programming or Golang
• Strong experience with Kubernetes, Terraform
• Experience with Linux administration
• Experience with AWS infrastructure
• Experience with bare metal infrastructure would be a plus
• A genuine passion for technology!

PERKS
• Able to work fully remote or from the awesome New York office
• The ability to solve very interesting challenges
• Technologists are in charge and report to other technologists
• Work with very talented engieners from MAMAA and finance
• Unlimited PTO
• 401k matching
• Healthcare, dental, vision, life insurnace
• Performance based bonus
• Constant need for learning, self development and progression
• Internal hackerrank events to promote engineer ideas
• A tonne of free food in the office

If interested then please apply with your updated resume ASAP as the client is actively hiring.
Apply Here
For Remote Senior Site Reliability Engineer – Health Tech – Up to $300,000 + unlimited PTO + fully remote! roles, visit Remote Senior Site Reliability Engineer – Health Tech – Up to $300,000 + unlimited PTO + fully remote! Roles

********

The Tech Career Guru
We will be happy to hear your thoughts

Leave a reply

Tech Jobs Here
Logo