Fulltime Site Reliability Engineer openings in New York, United States on September 19, 2022

Site Reliability Engineer at Hired

Location: New York

DevOps/Site Reliability Engineer/Cloud Engineer/Systems Engineer is gaining popularity in companies across all sectors, with DevOps Engineers/SRE leading the charge when it comes to putting best practices to use. They’re seen as the bridge between software and systems engineering to ensure products are released on time and within budget. DevOps Engineers implement and manage build infrastructure, build strategy (including container builds), and continuous delivery – while managing application configurations. They close the gap between the developer and the operations teams, wielding an arsenal of tech tools and experience to make the day-to-day operations on a company’s tech team as smooth and efficient as possible.

Having a hand in software releases that are quick, secure and of the highest quality is the goal of the DevOps Engineer/SRE and reflects well on their company.

We are looking for DevOps Engineer/SRE with the following types of experiences:
Responsibilities
• Designing and implementing solutions that make software deployments well-organized and automated.
• Coming up with effective results that bring developer & operations roles and goals together, and fostering a collaborative environment among all.
• Ensure product releases are high-quality and secure.
• Help automate processes to ensure efficiency, quality, and scalability.
• Liaise with various tech team members so product development and deployment are at their best.
• Having a firm understanding of business and client needs.
• Design, build, and maintain Continuous Improvement/Continuous Development (CI/CD), testing, and operations infrastructure.
Nice to Haves
• Experience with Gitlab-CI, Terraform, OpenStack Heat, Cloud Formation, Kubernetes/Docker, Azure, AWS, GCP, Kafka/Storm, Python, Go, Java, Spinnaker, Elasticsearch, Prometheus, APIGEE, HAProxy, NGINX, Cassandra, Zabbix, and/or other tools.
• Familiarity with infrastructure automation tools such as Puppet, Chef, Salt, Ansible, Jenkins, GoCD, Terraform, Artifactory, Nexus, InSpec, etc.
• Linux & Microsoft System Administration.
• Strong leadership and communication skills.
• BS or MS in Computer Science, Software Engineering or a related field, or equivalent work experience.
• 3+ years experience as a Software Engineer developing and maintaining an application.
• 3+ years experience with Linux administration (Full Stack or DevOps experience counts).
Apply Here
For Remote Site Reliability Engineer roles, visit Remote Site Reliability Engineer Roles

********

Tapad – Senior Site Reliability Engineer (Cloud) at Experian

Location: New York

Company Description

COVID-19 UPDATE: The NYC office has recently opened for local employees who meet vaccination requirements. All interviews will remain virtual. Position is open to remote employees in the US and Canada able to work hours of 9-5 EST.

Founded in 2010, Tapad cracked the code on cross-device marketing technology. Our groundbreaking, proprietary technology assimilates trillions of data points to find the relationship between smartphones, desktops, laptops, tablets, and connected TVs. Ten years later, we are processing data at petabyte scale, with an engineering team that comprises roughly half of our entire organization. When you work with us, you matter, and your work matters.

Job Description

Small Teams; Big Data

At Tapad, we look for individuals who are motivated by complex and challenging work. We want to work with people who share compelling solutions to those challenges, solutions informed by their unique experiences, passions, and expertise.

Tapad’s Infrastructure team is looking to add a Senior Site Reliability Engineer (SRE) to bring Infrastructure-as-code (IaC) to our SDLC. As a Senior SRE, you are responsible for the overall availability, efficiency, performance, and resilience of Tapad’s software systems. You will divide your time between system operations duties and developing software and tools that help increase system reliability and performance. You will collaborate with software engineering teams and use similar technologies related to their software deliverable’s design, deployment, and continued operations.

We ask our employees to make an impact, and feel it is only right to give a lot in return. We offer every employee a 401k with matching, generous maternity/parental leave, and PTO. We believe if you’re sick, feel like you’re getting sick, or just need a personal day, you should take that time to get better. We have free virtual lunches every month, free continuous education, and an open door policy every day. We make sure our virtual office is a welcoming space, full of individuals who can teach and learn from one another everyday.

A day in the life as a Senior Site Reliability Engineer:
• Apply the principles of software engineering and engineering best practices to system operations and administration tasks
• Design and implement software tools to increase reliability, performance and operations of Tapad’s commercial software products and systems
• Lead the configuration, testing, security, and deployment of project work
• Contribute to libraries, data structures, and frameworks developed by other engineers and, occasionally, to open source projects
• Instrument, monitor, and anticipate the runtime characteristics of the wider department’s software systems
• Participate in your team’s support rotation
• Maintain the SDLC quality bar by supporting the usage and adoption of DevOps abstractions that your team builds and provides to the wider engineering organization
• Liaise with program and project managers for cross-project and organization-wide project tracking
• Act as a ‘team lead’, including interviewing, mentoring and training teammates in Tapad’s technology stack, and in Site Reliability and DevOps best practices
• Contribute to the design and implementation of abstraction layers atop the offerings provided by SREs that facilitate and enable the wider engineering organization to programmatically provision, configure, and manage infrastructure resources

In This Role, You Would Be Using
• Google Cloud Platform
• Terraform
• Github Actions, Harness, Nexus
• Kubernetes and Helm
• Airflow, Cloud Composer, GKE
• BigQuery, CloudSQL, Pub/Sub
• Kubeflow Pipelines, Cloud Run, Cloud Functions
• Google Cloud Logging and Monitoring (Stackdriver), Looker
• Python, Golang, Scala

Qualifications

We are looking for candidates who meet some of the following qualifications:
• 5+ years in a cloud-based infrastructure role with development and automation experience
• 5+ years in a DevOps role with scripting/automation experience
• 2+ years of experience using Terraform in a cloud environment
• Shell, Python scripting abilities, familiarity with Golang
• Experience with other IaC tools including AWS CloudFormation, Red Hat Ansible, Chef, Puppet or SaltStack
• CI/CD workflows and SDLC tools (ex. Github actions, Jenkins, Harness)
• Cloud architecture (network, storage, compute, messaging, etc.)
• Passionate about best practices and reliable, sustainable, scalable environments

Bonus Experience
• Exposure to GCP ecosystem
• Knowledge of JVM languages such as Java/Scala/JVM0535
• Terraform
• Airflow
• Kubernetes
• Helm charts
• Go / Python

Tapad Perks
• Generous PTO, sick time off, and paid Volunteer Time Off (VTO)
• 401k matching, Life, LTD & STD Insurance, dental, vision, and telehealth plan with 24/7 access to a dedicated team of physical and mental healthcare providers
• Scala School (we’ll teach you!), LinkedIn Learning, peer-led professional development, continuous education stipend, and an abundance of resources to help you stay sharp
• Unlimited snacks and beverages for local staff in-office, collaboration lunches (virtual lunches for remote teammates)
• Discounts on gym memberships and wellness programs
• Foosball, ping pong, diversity and inclusion group, book club, virtual game nights and happy hours, and tons of other extra-curricular activities that will make you feel like part of the Tapad family
• Check out our #TapadLife page to see what our employees have to say
• Find more about our engineering culture HERE

Additional Information

Experian Marketing Service’s mission is to accelerate client success through enabling ecosystems, partnerships, and technology solutions. We help brands put people at the heart of their business and have meaningful interactions with their customers.

Founded in 2010, Tapad cracked the code on cross-device marketing technology, creating not only the first but the most robust global cross-device digital identity graph on the market. Ten years later, The Tapad Graph enables marketers to maximize their digital marketing investment for years to come. In November 2020, Experian acquired Tapad, which furthers Experian’s strong commitment to digital identity, activation and connected TV. As a leading provider of consumer data analytics and targeting solutions, Experian Marketing Services has a rapidly growing need to continue expanding our strategy on identity data and services to meet market demand.

All your information will be kept confidential according to EEO guidelines.

Experian is proud to be an Equal Opportunity and Affirmative Action employer. Our goal is to create a thriving, inclusive and diverse team where people love their work and love working together. We believe that diversity, equity and inclusion is essential to our purpose of creating a better tomorrow. We value the uniqueness of every individual and want you to bring your whole, authentic self to work. For us, this is The Power of YOU and and it reflects what we believe. See our DEI work in action!

If you live in Colorado, Connecticut or New York City, please contact us here for the salary range of this position (include this Job Title in your email). In addition to a competitive base salary and variable pay opportunity, Experian offers a comprehensive benefits package including health, life and disability insurance, generous paid time off including parental and family care leave, an employee stock purchase plan and a 401(k) plan with a company match.

Experian Careers – Creating a better tomorrow together

Find out what its like to work for Experian by clicking here
Apply Here
For Remote Tapad – Senior Site Reliability Engineer (Cloud) roles, visit Remote Tapad – Senior Site Reliability Engineer (Cloud) Roles

********

Sr. Site Reliability Engineer – Remote at Jobot

Location: New York

Problem solvers, lifelong learners, curious minds and tech obsessed engineers needed! Come join us as we grow our world class NaaS (Networking as a Service) & Cloud Connectivity service and enjoy great culture, comp and benefits!

This Jobot Job is hosted by Brendan Thomas

Are you a fit? Easy Apply now by clicking the “Apply” button and sending us your resume.

Salary $175,000 – $220,000 per year

A Bit About Us

We’re an award-winning global NaaS & Cloud Storage Provider that offers the most innovative tools in the industry.

Our customers enjoy faster integration, lower fees and full control of where their data is stored + much more.

Want to change the future with us? Apply now!

Why join us?
• World class team of innovators, lifelong learners, and problem solvers
• Strong compensation up to $200k annually + Bonus + Stock Options
• Fully remote
• Health, dental, vision, and life insurance benefits
• 401k
• Unlimited PTO
• Paid Maternity/Paternity leave and more!

Job Details

YOUR CONTRIBUTION
• Work with Devs to troubleshoot issues and provide systems level and architecture support
• Expand configuration management systems with innovative features
• Support Devs to bring new software and services to the relevant device
• Solve complex system stability issues
• Recommend new technologies that would strengthen our development and systems
• Automate Everything!

TECHNOLOGY
• Python
• Linux
• HTTP / TLS / HTTP/2
• ZFS, XFS, GPFS
• Cassandra
• RabbitMQ
• Kafka
• Ansible
• Salt
• Terraform
• Chef
• Puppet
• Github
• Ubuntu
• Jenkins
• Docker
• Kubernetes
• Nginx
• Postgres
• SSL
• PXE

Interested in hearing more? Easy Apply now by clicking the “Apply” button.
Apply Here
For Remote Sr. Site Reliability Engineer – Remote roles, visit Remote Sr. Site Reliability Engineer – Remote Roles

********

REMOTE – Senior Site Reliability Engineer at CyberCoders

Location: New York

Title: Senior Site Reliability Engineer

Location: Remote

Pay: $180k – $225k

Experience Requirement: 5+ years

We’re one of the world’s largest and most reputable business consulting companies, working with enterprise companies and industry leaders. This group brings together the best of our cloud capabilities to help clients use technology to transform their businesses. Due to a recent acquisition we are building out our development team to continue driving end-to-end value delivery at a lower cost and with higher higher ROIs to our clients. We’re looking for a qualified Cloud Architect to run the show!

What You Will Be Doing
• Develop and implement capabilities, which that serve clients and improve developer experience
• Create and maintain custom solutions, that help clients boost productivity and make timely and effective decisions
• Deliver solutions/expert knowledge in SCM and CI/CD tooling and practices for container workloads
• Master multiple programming / IaC languages
• Build SLA dashboards
What You Need for this Position
• 5+ years DevOps / SRE
• Bachelor’s/Masters Degree in Technology
• Experience /expertise with these languages is a plus (Python, Golang, Ruby, JavaScript, or other object oriented language)
• Cloud: AWS / GCP
• Containerization: Docker / Kubernetes
• Terraform
What’s In It for You
• Vacation/PTO
• Medical
• Dental
• Vision
• 401k
• Bonus
Benefits
• Vacation/PTO
• Medical
• Dental
• Vision
• 401k
• Bonus
So, if you are a REMOTE – Senior Site Reliability Engineer with experience, please apply today!

Colorado employees will receive paid sick leave. For additional information about available benefits, please contact Casey Glad

Email Your Resume In Word To

Looking forward to receiving your resume through our website and going over the position with you. Clicking apply is the best way to apply, but you may also:

Casey.Glad@
• Please do NOT change the email subject line in any way. You must keep the JobID: linkedin : CG12-1705046 — in the email subject line for your application to be considered.***
Casey Glad – Executive Recruiter – CyberCoders

Applicants must be authorized to work in the U.S.

CyberCoders, Inc is proud to be an Equal Opportunity Employer

All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability, protected veteran status, or any other characteristic protected by law.

Your Right to Work – In compliance with federal law, all persons hired will be required to verify identity and eligibility to work in the United States and to complete the required employment eligibility verification document form upon hire.

CyberCoders will consider for Employment in the City of Los Angeles qualified Applicants with Criminal Histories in a manner consistent with the requirements of the Los Angeles Fair Chance Initiative for Hiring (Ban the Box) Ordinance.
Apply Here
For Remote REMOTE – Senior Site Reliability Engineer roles, visit Remote REMOTE – Senior Site Reliability Engineer Roles

********

Senior Site Reliability Engineer (SRE) at Instabase

Location: New York

At Instabase, we’re passionate about democratizing access to cutting-edge AI innovation to enable any organization to solve previously unsolvable unstructured data problems in their industry. With customers representing some of the largest, and most complex institutions in the world, and investors like Greylock, Andressen Horowitz, and Index Ventures, our market opportunity is undeniable.

Instabase is a remote company rooted in flexibility. Employees can choose to work from one of our global offices in Menlo Park, New York, London, or Bangalore, fully remotely, or a mix of the two. At the center of our value proposition is our people, and we’ve built a fearlessly experimental, endlessly curious, customer focused team who together, are fundamentally changing how developers build and distribute intelligent business applications.

Instabase is an Equal Opportunity Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender perception or identity, national origin, age, marital status, protected veteran status, or disability status. Research shows that in order to apply for a job, women feel they need to meet 100% of the criteria while men usually apply after meeting about 60%. Regardless of how you identify, if you believe you can do the job and are a good match, we encourage you to apply.

Come help us build for the next stage of growth and scale — accelerate your career with Instabase!

Our Site Reliability Engineering team combines the Software Engineering & Systems Engineering to build scalable, distributed, fault-tolerant systems. The team keeps a watchful eye on the System Performance, Capacity and Failure modes to ensure high availability, and ability to grow.

What you’ll do:
• Set the technical direction for your team and work with multifunctional partners to deliver on your work
• Develop and execute against both short and long-term roadmaps, making effective tradeoffs between business impact, user experience, and a high-quality technical foundation
• Write code as we expect our technical leadership to be in the trenches alongside junior engineers, understanding root causes and leading by example
• Improve the team and company – you will be an active participant in our culture (mentorship, interviewing, and new initiatives)

About you:
• BS (or higher, e.g., MS, or Ph.D.) in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent technical experience
• 5+ years of professional experience working in Production Engineering, Site Reliability Engineering (SRE), DevOps, or equivalent positions
• Proficiency in programming languages or scripting languages
• Experience working with Infrastructure as Code / Automation tools (Ansible, Terraform)
• Experience with container orchestration systems
• Proven track record of technical leadership
• Strong knowledge of shipping impactful and complex software projects
• Ability to set technical and cultural standards for engineers
Apply Here
For Remote Senior Site Reliability Engineer (SRE) roles, visit Remote Senior Site Reliability Engineer (SRE) Roles

********

Sr. Site Reliability Engineer at Khayainfotech

Location: New York

Job Title: Sr. Site Reliability Engineer / Monitoring

Duration: Long Term Contract.

Location: New York(Need to on-site once a month)

Site Reliability Engineer good with Splunk and Dynatrace to set up Alerts dashboards and tune Splunk queries

Must Have :
• Root Cause Analysis
• Dynatrace
• Splunk
• Prod Support

As part of Platform Services Program, we are seeking a highly motivated, detail-oriented Senior Software Engineer-Site Reliability Engineer.

The position requires you to build and support API Gateway that provides API traffic routing for entire enterprise. Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant API systems. SRE ensures that Client’s API services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to users”” needs and a fast rate of improvement. Additionally SRE’s will keep an ever-watchful eye on our systems capacity and performance.

This is a high visibility team that requires you to build scalable, resilient, fault tolerant and secure features onto API Gateway ensuring we meet our availability and performance SLAs.

Responsibilities
• Engage in and improve the whole lifecycle of services—from inception and design, deployment, operation, and refinement.
• Provide guidance to other team members on managing availability and performance of mission critical services, on building automation to prevent problem recurrence, and on building automated responses for non-exceptional service conditions.
• Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews.
• Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.
• Scale systems sustainably through mechanisms like automation; evolve systems by pushing for changes that improve reliability and velocity.
• Practice sustainable incident response and blameless postmortems.
• Adhere to Client security standards, change management and quality controls, enabling automations where required.

Minimum Qualifications
• Experience working in Computer Science (e.g. networking, distributed systems, infrastructure, cloud)
• Experience with Unix/Linux operating systems internals, administration and networking
• Experience designing, building, and maintaining production services, and experience analyzing and troubleshooting systems
• Experience in building REST, gRPC and SOAP APIs
• Understanding of API authentication mechanisms such as oAuth, mTLS and JWT, SAML
• Understanding of PKI, certificate management lifecycle, symmetric and asymmetric encryptions.
• Experience with one or more of the following: Java, JavaScript, Python, Lua.
• Bachelor””s degree in Computer Science, a related technical field involving software/systems engineering, or equivalent practical experience.
• A proactive approach to spotting problems, areas for improvement, and performance bottlenecks
• Understanding of enterprise workloads
• Experience with algorithms and data structures
• Ability to debug, optimize code and automate routine tasks
– provided by Dice
Apply Here
For Remote Sr. Site Reliability Engineer roles, visit Remote Sr. Site Reliability Engineer Roles

********

Site Reliability Engineer at Dttl (Indeed)

Location: New York

We are looking for a brilliant Site Reliability Engineer to join our growing team at DTTL (Indeed) in Atlanta, GA.
Growing your career as a Full Time Site Reliability Engineer is an amazing opportunity to develop relevant skills.
If you are strong in adaptability, persuasion and have the right initiative for the job, then apply for the position of Site Reliability Engineer at DTTL (Indeed) today

Do you thrive on developing creative and innovative insights to solve complex challenges? Want to work on next-generation, cutting-edge products and services that deliver outstanding value and that are global in vision and scope? Work with other experts in your field? Work for a world-class organization that provides an exceptional career experience with an inclusive and collaborative culture?

Want to make an impact that matters? Consider Deloitte Global.

Work you’ll do:

We are looking for an Information Technology individual to focus on the Reliability of the Private and Public cloud applications. You will formulate, deliver, and/or manage assigned projects and work with other stakeholders or IT teams across Deloitte. You will have an opportunity to build our next-generation Cloud platform on a global level.

You are expected to have the technical expertise to drive technology strategy, execution, and improvements across the Continuous Integration / Continuous Delivery Pipeline. You will be working to modernize IT operations using Infrastructure as Code, Monitoring as Code, and build & deliver solutions to achieve the desired results.

We want someone who relentlessly pursues excellence, has the deep and broad technical expertise, and can build relationships across teams.

Responsibilities:
• Follow the team’s standards to create and manage an automated DevOps release management pipeline that delivers tooling for next-generation application development efforts and ongoing production operations. Cultivate a Continuous Integration/Continuous Delivery mindset
• Work with DevOps engineers and cloud architects to support development teams with a full set of ALM tools by leading the establishment of the right tooling and processes that will result in a fully automated release management pipeline to include: the automated build process, environment setups, testing scripts, deployments, and production operational metrics/debugging information (to target developers)
• Partner with development and operations teams to facilitate practical automation solutions and custom modules. Troubleshoot automation issues and when required engage additional resources to find practical solutions that move projects forward.
• Be an automation and tooling advisor by providing objective, practical and relevant insights, and advice
• Deliver assignments based on project objectives and support projects to completion. Ensure deliverables are completed within target timeframes and are of high quality.
• Support established KPIs to ensure performance is measured against expected business outcomes
• Work with teams to bring continuous improvement to DevOps processes and tools

What you’ll be part of – our Deloitte Global Culture:

At Deloitte, we expect results. Incredible-tangible-results. And Deloitte Global professionals play a unique role in delivering those results. We reach across disciplines and borders to serve our global organization. We are the engine of Deloitte. We develop and implement global strategies and provide programs and services that unite our network.

In Deloitte Global, everyone has opportunities. We see the importance of your perspective and your ability to create value. We want you to fit in-with an inclusive culture, focus on work-life fit and well-being, and a supportive, connected environment; but we also want you to stand out-with opportunities to have a strategic impact, innovate, and take the risks necessary to make your mark. Deloitte Technology Services works at the forefront of technology development and processes to support and protect Deloitte around the world. In this truly global environment, we operate not in “what is” but rather “what can be” to help Deloitte deliver and connect with its clients, its communities, and one another in ways not previously conceived. Required:
• Bachelor’s degree
• Minimum 7 years of experience in managing full application stacks from OS up through custom applications and hands-on experience using a leading cloud platform provider highly preferred (Microsoft Azure, Amazon Web Services (AWS), Google Cloud)
• Strong understanding across Cloud and infrastructure components (IaaS, PaaS & hybrid implementation) and its administration
• Strong experience with Azure DevOps & Developing templates or scripts to automate everyday developer or operations functions
• Strong experience with PowerShell scripting
• Strong experience with Azure DevOps
• Strong experience with Windows platform, IIS
• Working knowledge of Office 365 and Azure, Azure Active Directory, MFA and/or other directory services
• Strong experience with instrumentation, monitoring, alerting and responding relative to performance and availability of applications
• Experience with instrumentation, monitoring, alerting, and responding relative to performance and availability of applications
• Experience with deploying tools across the Continuous Integration / Continuous Deployment pipeline (examples include Git/SVN, Maven/Ant or similar)
• Experience with automation/configuration management using DevOps tools such as Puppet, Chef, Ansible or New Relic
• Knows what is possible using latest networking, infrastructure, database, and application technologies to driving automation and reliability improvements

Preferred:
• Experience with Docker, and container orchestration with Kubernetes, Mesos, Docker Swarm, or equivalent.
• Experience with different database like SQL Server 2008R2/2012/2016, open-source databases like Mongo DB, and NoSQL
• Hands on experience in software development (PowerShell / Perl / Ruby / Python)
• Some knowledge and experience with Intelligence/Data-warehousing products specially configuring, and administering

How you’ll grow:

Deloitte Global inspires leaders at every level. We believe in investing in you, helping you embrace leadership opportunities at every step of your career, and helping you identify and hone your unique strengths. We encourage you to grow by providing formal and informal development programs, coaching and mentoring, and on-the-job challenges. We want you to ask questions, take chances, and explore the possible.

Benefits you’ll receive:

Deloitte’s Total Rewards program reflects our continued commitment to lead from the front in everything we do-that’s why we take pride in offering a comprehensive variety of programs and resources to support your health and well-being needs. We provide the benefits, competitive compensation, and recognition to help sustain your efforts in making an impact that matters.

Corporate citizenship:

Deloitte is led by a purpose: to make an impact that matters. This purpose defines who we are and extends to relationships with our clients, our people, and our communities. We believe that business has the power to inspire and transform. We focus on education, giving, skill-based volunteerism, and leadership to help drive positive social impact in our communities. #LI-Hybrid Hybrid work, remote may be an option
Company Benefits:
● Excellent benefits
● Advancement opportunities
● Attractive package
Apply Here
For Remote Site Reliability Engineer roles, visit Remote Site Reliability Engineer Roles

********

Senior Site Reliability Engineer (AWS / Ruby / Terraform) at Jobot

Location: New York

SaaS platform revolutionizing efficiency / Industry leading compensation / Hybrid work environment (2-3 days on-site) in NYC!

This Jobot Job is hosted by: Craig Rosecrans
Are you a fit? Easy Apply now by clicking the “Apply” button and sending us your resume.
Salary: $140,000 – $210,000 per year

A bit about us:

We are a fast-growing and profitable SaaS company that is revolutionizing the way global users create time for themselves! Our world is becoming more and more plugged in and our consumers are changing. We are the 1st in the Industry to develop this software solution aimed at enabling humans to be 5X more efficient. As the first SRE at our client, you’ll be collaborating closely with the VP of Engineering, Director of Engineering and Senior Engineers to identify and make changes that increase the security, reliability and performance of our system. Our clients team cares deeply about infrastructure and developer best practices. Our infrastructure is managed entirely by Terraform, we have a comprehensive monitoring and alerting in place using Datadog and ship to production multiple times each day using our chatops driven CI/CD system. We’re looking for an experienced SRE to join our team and take what’s in place today to the next level, as we scale to hundreds of thousands of users.

Why join us?
• Competitive Base Salary
• 100% company paid health plan for employees
• Equity in high-growth start-up (not in lieu of a salary)
• Flexible Hours
• Very generous PTO
• Dental and Vision, FSA, HSA
• Small team, autonomy
• Many more great perks!

Job Details

Examples of Problems You’ll Be Solving:
 Build and maintain our infrastructure (with Terraform) to run our set of applications.
 Infrastructure scaling strategy based on usage and stress metrics.
 Responsibility of our disaster recovery plan and our infrastructure fault tolerant strategy.
 Apply security patches to our AMIs on a regular basis.
 Interact as an Engineer with a wide range of technologies including AWS, Docker, Postgres, Kafka/Kinesis & Ruby.
You’re Awesome Because:
 You’re interested in security, reliability and infrastructure as code. You are looking for a role that gives you exposure to a broad range of challenges.
 You enjoy working with others, and thrive in a dynamic fast-paced environment.
 You are passionate about your work and want to contribute to building a product that impacts the software development community.
 You value mentoring juniors and sharing knowledge in a blame-free environment.
Desired Qualifications:
 4+ years AWS experience.
 4+ years Terraform and scripting languages (e.g. Python, Ruby, BASH, …).
 Proficiency in load/stress testing.
 Efficient communication skills.
 Experience with Kubernetes is a plus.

Interested in hearing more? Easy Apply now by clicking the “Apply” button.
Apply Here
For Remote Senior Site Reliability Engineer (AWS / Ruby / Terraform) roles, visit Remote Senior Site Reliability Engineer (AWS / Ruby / Terraform) Roles

********

Site Reliability Engineer at Xometry

Location: New York

We are looking for a focused Site Reliability Engineer – All Levels to join our high calibre team at Xometry in Tulsa, OK.
Growing your career as a Full Time Site Reliability Engineer – All Levels is an amazing opportunity to develop excellent skills.
If you are strong in problem-solving, cooperation and have the right drive for the job, then apply for the position of Site Reliability Engineer – All Levels at Xometry today

Xometry (NASDAQ: XMTR) powers the industries of today and tomorrow by connecting the people with big ideas to the manufacturers who can bring them to life. Xometry’s digital marketplace gives manufacturers the critical resources they need to grow their business while also making it easy for buyers at Fortune 1000 companies to tap into global manufacturing capacity.

Xometry is looking for a Principal Level DevOps Engineer/Site Reliability Engineer who is excited about containers and container orchestration with Kubernetes, understands microservices, and has a passion for infrastructure as code. This person also has a passion for building tooling that makes it easier for others to build, deploy and scale their software in a cloud environment. The ideal candidate will possess a combination of strong leadership and technical skills.
What You’ll Do
• Be the driving force behind agility, reliability, and security across the organization
• Automate all the things, always
• Build new tools and platforms when you see repeatable patterns across the team workflows
• Coach Software Engineering and Data Science teams on best practices and architectural decisions
• Own the security operations that protect our customer data while maintaining development velocity
• Obsess over feedback loops: build, measure, and improve
• Passion for resolving reliability issues and identifying strategies to mitigate repeat issues
• Passion for enabling the software engineering community to build faster with less frictionWhat We’re Looking For
• Many years of experience as a Site Reliability Engineer or DevOps engineer in an eCommerce, API based, or B2C platform company. Said differently – this isn’t your first SRE rodeo
• Strong experience with AWS (preferred), Azure, Google cloud offerings
• Excellent understanding of Internet technologies and protocols (TCP/IP, DNS, HTTP, SSL, etc.)
• Strong experience in an application environment with API fundamentals (REST and GraphQL) as well as Docker, Kubernetes, Service Mesh, and Microservices
• A master of root cause analysis, especially of complex distributed systems
• Solid understanding of large-scale complex systems from a reliability perspective
• Excellent leadership and communication skills
• Must be willing to travel to other Xometry offices on occasion and when needed If this job isn’t for you but you have a friend who may be a perfect fit – share this job with them Xometry is an equal opportunity employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status. Xometry participates in E-Verify and after a job offer is accepted, will provide the federal government with your Form I-9 information to confirm that you are authorized to work in the U.S.
Company Benefits:
● Excellent benefits
● Advancement opportunities
● Attractive package
Apply Here
For Remote Site Reliability Engineer roles, visit Remote Site Reliability Engineer Roles

********

Site Reliability Engineer Specialist at iFood

Location: New York

Seu Cardápio DiárioEstruturação de plataformas em nuvem (AWS) Plataformas de API Linguagens de programaçãoIngredientes que buscamosConhecimentos avancados em redes L7 (DNS) AWS relacionada a redes (ELB,ALB, Transit Gateway, VPC, ACM, Route 53 )Conhecimentos em CDN (Content Delivery Network (akamai,azion,cloudflare, cloudfront))TerraformHelmConhecimentos em NGINXProtocolos de comunicacao (HTTP, GRPC)CORSPython Nice to have:Kubernetes, containersChefKong API GatewayAPI REST (JWT, OAuth2, OpenID Connect)Golang, outras linguagens
Apply Here
For Remote Site Reliability Engineer Specialist roles, visit Remote Site Reliability Engineer Specialist Roles

********

The Tech Career Guru
We will be happy to hear your thoughts

Leave a reply

Tech Jobs Here
Logo