Fulltime Site Reliability Engineer openings in New York, United States on September 08, 2022

Site Reliability Engineer at ROKT

Location: New York

About Rokt

Description

Rokt is the global leader in ecommerce technology, helping companies seize the full potential of every transaction moment to grow revenue and acquire new customers at scale. Live Nation, Groupon, Staples, Lands’ End, Fanatics, UrbanStems, GoDaddy, Vistaprint and HelloFresh are among the more than 2,500 leading global businesses and advertisers that are using Rokt’s solutions to drive more value through every transaction by offering highly relevant messages to their customers at the moment they are most likely to convert.

With our December 2021 Series E raise of USD$325M, Rokt is expanding rapidly and globally – operating in 19 countries across North America, Europe and the Asia-Pacific region with the largest office in NYC and a major R&D hub in Sydney. With annual revenues of more than US$200M and vibrant company culture, Rokt has been listed in Great Places to Work’ in the US and Australia. Our award-winning culture is guided by our five core values: Smart with Humility, Own the Outcomes, Force for Good, Conquer New Frontiers, and Enjoy the Ride. These values help us attract, engage, and develop the right talent around the globe and ensure we have the right conditions to do our best work. Keen to join a fast-growing company and a vibrant culture? Learn more at .

The Rokt engineering team builds best-in-class ecommerce technology that provides personalized and relevant experiences for customers globally and empowers marketers with sophisticated, AI-driven tooling to better understand consumers. Our bespoke platform handles millions of transactions per day and considers billions of data points which give engineers the opportunity to build technology at scale, collaborate across teams and gain exposure to a wide range of technology. We are expanding rapidly in our major R&D centers in NYC and Sydney. We are passionate about using intelligent systems to improve the transaction moment for retailers everywhere. Come join us and build the future!

The Role

As a Site Reliability Engineer you will be part of a team responsible for designing and building high levels of availability, scalability and reliability into our systems. You will become intimate with the architecture of our systems and be responsible for diving deep into code, assist with architecture and root cause analysis workshops working directly with feature teams.

Responsibilities
• Design, develop, test, deploy and improve code that solves real world problems
• Manage priorites, deadlines and deliverables
• Operate with autonomy in solving problem
• Collaborate with other teams
• Engage in and improve services-from inception and design, deployment and in use
• Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.
• Scale systems sustainably through automation
• Evolve systems by pushing for changes that improve reliability and latency
Requirements
• Bachelor’s degree or equivalent practical experience.
• 3 years hands-on experience in Site Reliability and Observability Engineering, debugging, diagnosing and correcting errors and resolving high severity incidents
• Commercial experience in one of the following languages Java, C#, Python or Go.
• Think about systems – edge cases, failure modes, behaviors, specific implementations.
• You have hands-on development experience with cloud infrastructure and tooling (AWS, GCE, Azure, Kubernetes, Docker, CI/CD pipelines & Terraform).
• Understanding of Defensive programming, Circuit breakers, Resilience frameworks, Fault tolerance, and self-healing mechanisms of services.
• Experience working on various monitoring, and alerting tools
• Strong organizational and interpersonal skills
• You have handled multiple on call shifts, and have navigated more than one incident through to the retrospective process.
• At Rokt we encourage autonomy; teams have complete ownership of their systems including building, running and monitoring. As such, you may be required to be on-call and respond to systems alerts should they arise.
• Ideas, opinions, and the ability to share them through respectful proposals, presentations, and team-wide discussions, An eagerness to work and learn in the open and share your learnings with your teammates.
• A willingness and comfort communicating remotely through chat, docs, video calls, and other collaborative online tools
Benefits
• Force for Good. We actively invest in the growth of our people and the strengthening of our communities. Our NYC office is 100% vaccinated to keep our employees and community safe and healthy. We require all Rokt’stars as well as anyone else who will be onsite at the Rokt NYC office – clients, contractors, vendors, and suppliers – to show proof of vaccination and their booster shot.
• Work with the greatest talent in town. Our recruiting process is tough. We hold a high bar because we have a high-performing, high-velocity culture – we only want the brightest and the best.
• Join a community. We believe the best things happen when we come together to solve complex problems and make meaningful connections with each other through interest groups, sports clubs, and social events.
• Accelerate your career. Develop through our global training events, Level Up’ investment, online training courses, and our fantastic people leaders. Take your career to Rokt’speed – Grow your career in our rapidly growing company.
• Take a break. When you work hard, we know you also need to rest. We offer generous time off and parental leave policies, as well as mental health and wellness days for all employees. We also offer a paid Rokt’star Sabbatical for employees who have been with us for 3 years or more.
• Stay happy and healthy. Enjoy catered lunch 3 times a week and healthy snacks in the office. Plus join the gym on us! In the US, access generous retirement plans like a 4% dollar-for-dollar 401K matching plan and get fully funded premium health insurance for your whole family. And our NYC office is dog-friendly!
• Become a shareholder. All Rokt’stars have stock options. If we succeed, everyone enjoys the upside.
• See the world! Along with our global all-staff events in amazing locations (Phuket, Thailand in January 2020, Hawaii in May 2022), we also offer generous relocation packages for those interested in moving to another Rokt office. We have cool offices in great cities – New York, Sydney, London, Singapore, Tokyo.
• Get the best of both worlds with a hybrid workplace. We currently work 3 days a week in office, allowing you to enjoy the best of both worlds (please note: this is subject to change based on the needs of the business and some support roles still require a full time presence). One week per quarter, you also have the flexibility to work from anywhere.
• We believe in equality. Rokt is an Equal Opportunity Employer and recognizes that a diverse workforce is crucial to our success as a business. We would love you to apply for one of our open roles – irrespective of socio-economic status or background, age, gender identity, race, religion, sexual orientation, color, pregnancy, carer/family responsibilities, national and social origin, political opinion, marital, veteran, or disability status
Salary range: $140,000 – $180,000 / year
Apply Here
For Remote Site Reliability Engineer roles, visit Remote Site Reliability Engineer Roles

********

Site Reliability Engineer at American Express

Location: New York

WHO WE ARE:

Resy is a hospitality technology platform that powers restaurants around the world and a consumer-facing reservation platform for passionate diners. Since its inception in 2014, Resy has created best-in-class software that elevates dining experiences and connects restaurants to a growing network of highly-engaged diners, with the powerful backing of American Express. Resy is a go-to destination for restaurant discovery, exclusive access, original content, and chef-driven culinary events. The amazing world of restaurants is just two taps away in the Resy app and at Resy.com.

Resy’s restaurant partners use Resy OS — resyos.com – a complete end-to-end technology platform — to run their businesses smarter and more efficiently. Resy OS takes into account pre, during and post service guest touchpoints to help operators deliver the best hospitality experience. Platform features include table management, wait list functionality, customized SMS text confirmations, guest feedback tools, reports and analytics for 360-degree visibility into operational performance, ticketing, PCI Level 1 compliant third-party credit card handling, and more.

We are seeking experienced Site Reliability Engineers to join our team.

WHAT YOU’LL DO:
• Ability to collaborate with remote peers across time zones, multi-tasking ticket completion and avoiding mental blockers
• Excellent written and verbal communication skills, including technical documentation and UML diagramming
• Strong Linux fundamentals and scripting experience, preferably BASH and/or Python
• Solid understanding of DNS, TLS, and networking architecture
• Operational experience (on-call rotation, incident response)
• Experience with AWS infrastructure: EC2, ALB/ELB, S3, etc
• Experience with at least one IaC tools: Terraform, CloudFormation, CDK, Pulumi, etc
• Experience with logging and monitoring utilities, especially Datadog
• Experience automating development workflow pipelines with CI tools: Jenkins, GitHub Actions, etc
• Working knowledge of Terraform, Packer, and general configuration management concepts
• Working knowledge of at least two of the following: NGINX, Redis, Kafka, ELK, Snowflake
• For future projects, experience with containerization & orchestration: Docker/Podman, EKS, Kubernetes, Istio, Helm, Crossplane

Resy is committed to Equal Employment Opportunity through attracting and retaining a diverse team of employees and creating an inclusive environment for all. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Currently, the Company requires that colleagues, effective March 1, must have received a booster shot against COVID-19 in order to work in or visit any of our US offices, subject to legally required accommodations. If the role you are applying for is designated as hybrid or onsite, you will be required to visit our offices.”

American Express is an equal opportunity employer and makes employment decisions without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran status, disability status, age, or any other status protected by law.

We back our colleagues with the support they need to thrive, professionally and personally. That’s why we have Amex Flex, our enterprise working model that provides greater flexibility to colleagues while ensuring we preserve the important aspects of our unique in-person culture. Depending on role and business needs, colleagues will either work onsite, in a hybrid model (combination of in-office and virtual days) or fully virtually.

If the role you are applying for is designated as hybrid or onsite, you will be required to demonstrate that you have completed your primary COVID-19 vaccination series (i.e. 2 doses for Moderna/Pfizer and 1 dose for J&J), in order to work in or visit any of our offices. This requirement is subject to legally required accommodations.

US Job Seekers/Employees – Click here to view the “EEO is the Law” poster and supplement and the Pay Transparency Policy Statement.

If the links do not work, please copy and paste the following URLs in a new browser window: https://www.dol.gov/agencies/ofccp/posters to access the three posters.
Apply Here
For Remote Site Reliability Engineer roles, visit Remote Site Reliability Engineer Roles

********

Site Reliability Engineer at Perennial Resources International

Location: New York

Contract

Location: 120 Park Ave

Site Reliability Engineer – Enterprise Console (Consultant)

What’s in it for you:

As a Site Reliability Engineer (SRE), your mission is to improve the platform’s reliability, scalability, and performance on cloud-based infrastructure. As part of the team, you will have the opportunity to work alongside engineers with the same goals in mind and be exposed to many open-source solutions and tools. Our team also plans to upgrade our platform to use the latest version of some open-source technologies and adopt new ones, so it is a rewarding experience you can explore with us.

We’ll trust you to:
• Own the production environment running our PaaS product
• Plan, prioritize, and manage migrations while working with multiple stakeholders and ensuring continuous service availability
• Improve overall observability by implementing monitoring, metrics, logs, and Service Level Objectives (SLO)
• Write scripts in Python to automate tasks and interact with APIs
• Troubleshoot production problems as they occur, and drive the post-mortem process
• Measure current capacity, predict future capacity needs and make suggestions accordingly

You need to have:
• Proficiency in developing code in least one high-level programming languages (Java, Python, C/C++, or C#)
• 2+ years of experience working on highly available, fault-tolerant distributed systems
• Experience in all phases of the Agile and test-driven SDLC

We’d love to see:
• 3+ years of Java/Scala experience
• 1+ years of hands-on experience working with Kafka, HBase, Hadoop, and Streaming frameworks
• Familiarity with Kubernetes/docker/containers
• Expertise in designing, analyzing, and troubleshooting large-scale distributed systems
• A keen interest in technological advances and the ability to incorporate new technology into existing systems
• Create project ideas and implement them with effective collaboration and communication
Apply Here
For Remote Site Reliability Engineer roles, visit Remote Site Reliability Engineer Roles

********

Site Reliability Engineer at Persado

Location: New York

Who We Are

Description

Persado is the only Motivation AI platform that enables personalized communications at scale to immediately inspire each individual to engage and act. Organizations that use Persado reach a tipping point in their ability to understand their customer, generating powerful, on-brand content and communications that drive value.

As an employer, Persado is committed to creating a place where everyone’s unique perspective is valued. We understand that our team members and our inclusive culture are what make Persado special. Persado is proud to be named on Fast Company’s World’s Most Innovative Companies list in 2020 and Built In’s Best Places To Work in 2021 & 2022.

What We Are Looking For

Persado is looking for a Site Reliability Engineer to work on maintaining and improving both customer-facing and internal systems from an efficiency and resiliency perspective. (EST or CST business hours)

What You Will Work On
• Help us ship existing and new product functionality in our SaaS products using tools such as Python, Kubernetes, AWS, etc., and make sure their performance is in alignment with business goals and trade-offs
• Free up resources and reduce waste by automating repetitive tasks
• Diagnose and mitigate problems related to reliability and performance, and learn how to reduce the risk of failure
• Communicate and share knowledge with other teams and individuals, helping your colleagues grow their skillset
• Invest in your career and your personal growth, with the help of the company’s learning and development budget, by studying subjects of interest, attending events, and generally taking care of yourself

What You Bring
• A commitment to achieving win-win outcomes across different disciplines
• Good writing skills in English
• 2 years minimum experience working in an engineering team in a technical capacity
• Experience writing production quality code, in at least one language (includes scripting languages)
• Ability to troubleshoot issues with Unix/Linux servers and networking
• Experience with effective usage of data storage systems (RDBMS, Key-Value, Warehouses, Object stores, etc.)

Also Appreciated
• Experience with configuration management (e.g. configuration-as-code)
• Experience with container orchestration (e.g. Kubernetes)
• Monitoring technologies such as Nagios and Prometheus
• Cloud computing platforms such as AWS, Azure and GCP

What We Offer

Achieve your life goals and work goals at Persado.
• Persado’s hybrid working model empowers both remote and in-office work equitably!
• Competitive and equitable compensation
• Generous benefits packages globally
• 401k matching (USA); Pension Scheme (Certain EU locations) to prepare for your future
• We encourage professional growth through our dedicated enablement and training teams, as well as on demand tools and resources
• $1250 Employee Enrichment Fund to pursue a passion or upgrade your home office!
• Structured onboarding program to ensure a confident start and long-term success for new hires!
• Strong emphasis on career development and mobility, continuous feedback loops and performance management
• Flexible time off to support work-life harmony (including Summer Fridays)
• #PersadoCares! 2 paid Volunteer days per year and $100 charitable donation match
• Robust Diversity, Inclusion and Belonging initiatives; culture month celebrations, monthly diverse speaker series, commitment to bias-free recruitment, ERGs (#culture, #mindsmatter, #parents, #women, #green, #pride and growing)!
• Recognition, Rewards and Ideas to Action programs to recognize the contributions and impact of Persadoans across the globe!

Valuing diversity at Persado means recognizing and respecting human differences and similarities. Persado is committed to diversity with respect to all aspects of employment. All decisions regarding recruitment, hiring, promotion, compensation, employee training and development, and all other terms and conditions of employment, will be made without regard to race, religious beliefs, color, gender identity, sexual orientation, marital status, physical and mental disability, age, ancestry or place of origin.
Apply Here
For Remote Site Reliability Engineer roles, visit Remote Site Reliability Engineer Roles

********

Site Reliability Engineer at Resy – American Express

Location: New York

Job Description

WHO WE ARE:

Resy is a hospitality technology platform that powers restaurants around the world and a consumer-facing reservation platform for passionate diners. Since its inception in 2014, Resy has created best-in-class software that elevates dining experiences and connects restaurants to a growing network of highly-engaged diners, with the powerful backing of American Express. Resy is a go-to destination for restaurant discovery, exclusive access, original content, and chef-driven culinary events. The amazing world of restaurants is just two taps away in the Resy app and at Resy.com.

Resy’s restaurant partners use Resy OS — resyos.com – a complete end-to-end technology platform — to run their businesses smarter and more efficiently. Resy OS takes into account pre, during and post service guest touchpoints to help operators deliver the best hospitality experience. Platform features include table management, wait list functionality, customized SMS text confirmations, guest feedback tools, reports and analytics for 360-degree visibility into operational performance, ticketing, PCI Level 1 compliant third-party credit card handling, and more.

We are seeking experienced Site Reliability Engineers to join our team.

WHAT YOU’LL DO:

● Ability to collaborate with remote peers across time zones, multi-tasking ticket completion and avoiding mental blockers
● Excellent written and verbal communication skills, including technical documentation and UML diagramming
● Strong Linux fundamentals and scripting experience, preferably BASH and/or Python
● Solid understanding of DNS, TLS, and networking architecture
● Operational experience (on-call rotation, incident response)
● Experience with AWS infrastructure: EC2, ALB/ELB, S3, etc
● Experience with at least one IaC tools: Terraform, CloudFormation, CDK, Pulumi, etc
● Experience with logging and monitoring utilities, especially Datadog
● Experience automating development workflow pipelines with CI tools: Jenkins, GitHub Actions, etc
● Working knowledge of Terraform, Packer, and general configuration management concepts
● Working knowledge of at least two of the following: NGINX, Redis, Kafka, ELK, Snowflake
● For future projects, experience with containerization & orchestration: Docker/Podman, EKS, Kubernetes, Istio, Helm, Crossplane

Resy is committed to Equal Employment Opportunity through attracting and retaining a diverse team of employees and creating an inclusive environment for all. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Currently, the Company requires that colleagues, effective March 1, must have received a booster shot against COVID-19 in order to work in or visit any of our US offices, subject to legally required accommodations. If the role you are applying for is designated as hybrid or onsite, you will be required to visit our offices.”

Resy is operated by RESY NETWORK, INC. (hereinafter, “we”, “our” or “us”). We respect your privacy and are committed to protecting your information. Your provision of your personal information to us is completely voluntary. “Personal information” is information that can specifically identify you. We do not collect personal information unless you submit that information to us. Please refer to our Privacy Policy for more information on how we collect and process your personal information.

Powered by JazzHR

x5Nd7taRUw
Apply Here
For Remote Site Reliability Engineer roles, visit Remote Site Reliability Engineer Roles

********

Site Reliability Engineer at Harry’s

Location: New York

Harrys is a next-generation consumer packaged goods company focused on expanding and strengthening our DTC brands.The engineering team launches and maintains the software that supports these brands. We value continuous improvement and learning, teamwork and collaboration, creative problem solving, and open and direct dialogue and feedback.

The engineering team at Harrys is responsible for building full-stack services to support all E-commerce activities. These systems range from order and fulfillment systems, to customer facing websites.

About the Role

Were looking for a Site Reliability Engineer who is interested in E-commerce and cares about implementing DevOps and SRE principles to provide a quality experience to other engineers and our customers.

This role supports all of HARRY’S brands. You’ll work closely with engineering and product management to achieve business goals, while driving engineering excellence and best practices in our engineering organization. The systems and software you build will help us serve thoughtful and delightful customer experiences in a way our retail competitors cannot.

We use industry standard technologies like CI/CD, containers, kubernetes, terraform and AWS. It’s ok if you are not familiar with some of these because we have a strong culture of mentorship at Harry’s and we’ll support you as you get up to speed.

What you will do:
• Partner with product and engineering teams to understand overarching business objectives and translate those into actionable plans.
• Help the SRE team define technology and business strategies that deliver iterative enhancements to the tools and processes that improve availability, observability and scalability.
• You will be a force multiplier by creating clear and complete examples, frameworks, documentation, or other work products to enhance the ability of others to work autonomously or in collaborative teams.
• Implement best practices from SRE and DevOps to build high-quality, high-value software, and be an advocate for DevOps across the engineering team.

Requirements for the role:
• A Bachelor’s degree in computer science, software engineering, or similar (or equivalent non-traditional training) and 3+ years experience
• Demonstrates knowledge about AWS, terraform, containers, virtualization
• Demonstrates close working relationships with other engineers through training, communication and pair programming.
• Experience in at least one programming language.

The type of engineer we’re looking for:
• You’re always seeking to improve yourself, your team, and the world around you
• You thrive on direct, honest, and supportive communication
• You are always thinking about how to help the teammates around you excel
• You work effectively autonomously and collaborate within small teams as necessary to create high quality work products.

About Harry’s

Harry’s Inc. started in 2013 with a specific goal: disrupt the shaving industry by creating an innovative, everyday product at a fair price. Since then, Harry’s has expanded to Canada and Europe, developed relationships with retailers such as Target and Walmart, expanded our grooming brand into a personal care powerhouse, launched two new brands including Flamingo and Cat Person, and made our first brand acquisition with Lume.

The key to our success? Our amazing people. From chemists, mechanical engineers, CX associates, to creative directors, sourcing managers, and logistics specialists, the Harry’s team is composed of some of the most brilliant, diverse, and humble people you’ll ever meet.

Our brands answer unmet consumer needs, but our company is a place of inclusion and innovation that attracts some of the brightest minds across industries, geographies, and backgrounds. Whether we have a team of 5 or 500, our core values and our startup mentality remain; we value continuous improvement and learning, teamwork and collaboration, creative problem solving, and open and direct dialogue and feedback.

While this role is remote-first, folks have the option of working from our beautiful, 88,000 square foot SoHo office . What will you get out of that? Bagels on Tuesdays, lunch on Wednesdays and Thursdays, and fully stocked kitchens with snacks, coffee, and drinks everyday. Can’t forget the free products and the opportunity to have some meetings without Zoom (remember what 2019 was like?)

We are still requiring vaccinations for in-office employees. Reasonable accommodations due to a medical reason or sincerely held religious belief may be made.

Benefits and perks
• Medical, dental, and vision coverage
• 401k match
• Equity in Harry’s
• Unlimited PTO and flexible working hours
• Wellness and L&D stipends
• One month sabbatical after 5 years
• 16 weeks parental leave
• Fun IRL and virtual events including happy hours, team building events, and parties on our rooftop
• Free products from all of our brands

We have a mandatory COVID-19 vaccination policy.

Harry’s is committed to bringing together individuals from different backgrounds and perspectives. We strive to create an inclusive environment where everyone can thrive, feel a sense of belonging, and do great work together.

Harry’s is an Equal Opportunity Employer, providing equal employment and advancement opportunities to all individuals. We recruit, hire and promote into all job levels the most qualified applicants without regard to race, color, creed, national origin, religion, sex (including pregnancy, childbirth and related medical conditions), parental status, age, disability, genetic information, citizenship status, veteran status, gender identity or expression, transgender status, sexual orientation, marital, family or partnership status, political affiliation or activities, military service, domestic violence victim status, arrest/conviction record, sexual or reproductive health decisions, caregiver status, credit history immigration status, unemployment status, traits historically associated with race, including but not limited to hair texture and protective hairstyles or any other status protected under applicable federal, state and local laws. Harry’s commitment to providing equal employment opportunities extends to all aspects of employment, including job assignment, compensation, discipline and access to benefits and training.

We respect the laws enforced by the EEOC and are dedicated to going above and beyond in fostering diversity across our company.

This role can be done remotely, however there may be location constraints based on where Harrys is registered and able to employ individuals. Please work with your recruiter and your hiring manager to understand any location constraints. We are authorized and able to employ individuals in many, but not all states. If you are not located in or able to work from a state where we are registered or able to employ individuals, you will not be eligible for employment. Please speak with your recruiter to learn more.
Apply Here
For Remote Site Reliability Engineer roles, visit Remote Site Reliability Engineer Roles

********

Site Reliability Engineer (SRE) Vice President in Securitized Product Group (SPG) Production Management at JPMorgan Chase & Co.

Location: New York

J.P. Morgan’s Securitized Products Group is engaged in a variety of activities related to consumer and real estate assets. The business originates, underwrites and trades mortgage and other asset-backed receivables, and the product mix includes residential and commercial mortgage-backed securities and loans, as well as auto, credit card, student loan and consumer receivables. The group’s capabilities include sales, trading, lending, financing, origination, capital markets, syndicate and special opportunities activities on a single platform.

As a Site Reliability Engineer (SRE), you’ll build engineering disciplines, combining software and systems to develop creative engineering solutions to operations problems. Our software development focuses on optimizing existing systems, building infrastructure, and reducing manual toils through automation. You’ll join a global team of Site Reliability Engineer (SRE) and production managers with a diverse set of perspectives who are thinking big and innovating. In this environment, you’ll take the lead on relevant projects, supported by an organization that provides the support and mentorship you need to learn and grow. As an SRE, you’ll focus on running better production applications and systems.

Responsibilities
• Design and implement appropriate monitoring, logging, and telemetry solutions based on the system design and usage
• Design, develop, code, test, and deliver resilient software to automate manual operational work
• Review and approve software and product upgrades, change management, and release management
• Engage with the development teams throughout the software development life cycle for reliability, resiliency, and scalability
• Troubleshoot priority incidents, facilitate post-mortem discussions with other teams, and ensure permanent closure of incidents
• Coach and mentor junior SRE team members
• Contribute and leverage other SRE communities in the firm

Required

Qualifications:
• Bachelor’s degree or equivalent in an software engineering or computer science disciplines
• Minimum of 8 years of working experience in the information technology
• Excellent debugging ,trouble shooting and communication skills
• Expertise in at least one technology stack designing, coding, testing, and delivering software
• Understanding and practice of SRE principles (SLO/SLI/SLA/Error budget)
• Experience with Unix/Shell scripting and Hudson/Jenkins for deployment
• Working knowledge of infrastructure components (e.g. routers, load balancers, cloud products, container systems, compute, storage, and networks)
• Proficient in Linux and Windows platforms
• Proficient in Core Java, Python, C or C++
• Proficient with any RDBMS
• AWS certifications

Preferred
• Experience in information technology in financial industry and securitized products
• Experience with APM tools like Dyntrace or Appdynamics
• Experience with Tibco EMS, Kubernetes, Docker and other cloud technologies
• Experience with monitoring technologies e.g. Geneos, Datadog
• Experience with process scheduling technologies e.g. Autosys, Control-M

JPMorgan Chase & Co., one of the oldest financial institutions, offers innovative financial solutions to millions of consumers, small businesses and many of the world’s most prominent corporate, institutional and government clients under the J.P. Morgan and Chase brands. Our history spans over 200 years and today we are a leader in investment banking, consumer and small business banking, commercial banking, financial transaction processing and asset management.

We recognize that our people are our strength and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. In accordance with applicable law, we make reasonable accommodations for applicants’ and employees’ religious practices and beliefs, as well as any mental health or physical disability needs.

The health and safety of our colleagues, candidates, clients and communities has been a top priority in light of the COVID-19 pandemic. JPMorgan Chase was awarded the “WELL Health-Safety Rating” for all of our 6,200 locations globally based on our operational policies, maintenance protocols, stakeholder engagement and emergency plans to address a post-COVID-19 environment.

As a part of our commitment to health and safety, we have implemented various COVID-related health and safety requirements for our workforce. Employees are expected to follow the Firm’s current COVID-19 or other infectious disease health and safety requirements, including local requirements. Requirements include sharing information including your vaccine card in the firm’s vaccine record tool, and may include mask wearing. Requirements may change in the future with the evolving public health landscape. JPMorgan Chase will consider accommodation requests as required by applicable law.

Equal Opportunity Employer/Disability/Veterans
Apply Here
For Remote Site Reliability Engineer (SRE) Vice President in Securitized Product Group (SPG) Production Management roles, visit Remote Site Reliability Engineer (SRE) Vice President in Securitized Product Group (SPG) Production Management Roles

********

Senior Site Reliability Engineer at Landing

Location: New York

At Coalition Inc. (Permanent), in Multiple Locations

Expires at: 2022-09-18Remote policy: Full remote

About UsFounded in 2017, Coalition is on a mission to solve cyber risk and create a safer digital economy where everyone can thrive.
Digital risk is now a part of every business and it’s no longer solely the domain of technical teams.

That’s why we combined comprehensive insurance with proactive cybersecurity tools to help organizations stay resilient to digital risks like cyber attacks, funds transfer fraud and much more.

Our team works collaboratively across North America and Europe to prevent security failures and provide both technical and financial help when incidents do occur.

Today, Coalition is the world’s largest commercial insurtech serving over 130,000 customers including many small businesses that rely on Coalition to help them chart a path forward in the new digital world.
As of September 2021, Coalition has raised $520 million from leading global technology investors as well as highly-regarded institutional investors including:
Index Ventures, Ribbit Capital, Valor Ventures, Durable Capital, T. Rowe Price Advisors, and Whale Rock Capital, valuing the company at more than $3.5 billion.

Coalition has experienced tremendous growth by helping organizations of all sizes solve real-world problems and by remaining true to our founding values of character, humility, responsibility, authenticity and diversity.

That’s why we are proud to be named one of Inc’s Best Places to Work in 2021.About The RoleWe are looking for a Senior Site Reliability Engineer (Remote) who has the experience, ability, and mental fortitude to instrument and monitor the breadth of our full platform stack (hosts, applications, and performance).

In this role you will work closely with our engineering and information security teams to enhance the automated system provisioning and deployment subsystems within codified infrastructure.
You will work with developers to create more robust and scalable services independent of cloud implementations.

You will help to isolate, trap, and respond from the inevitability of system failure and develop strategies for continuous monitoring and analysis to reduce both downtime and required manual intervention.
You will participate in On-Call rotation to maintain platform SLAs.
Our core platform is written mostly in Python.

We prefer to use the right tool for the job and make pragmatic decisions about how to scale and decouple systems as we continue to grow.

We’re looking for someone who can navigate a cloud environment (across multiple providers) and bare metal with many moving pieces and systems to help the team understand how they fit into the broader puzzle.

ResponsibilitiesEnsure performance, responsiveness, scalability and automation – help us iterate faster and run smoothlyCollaborate with team members to ensure scalable and automated servicesReview work done by other engineersResearch, learn and improve a large scale scanning and data processing platform Main requirements Skills and QualificationsBe a part of a remote teamAt least 6 years of experienceDistributed Systems ArchitecturesCloud Providers (AWS, Google Cloud, Azure, Digital Ocean, etc.)NoSQL databases, Message Queues & Streaming platformsGood knowledge of Linux Systems (We don’t use Windows Servers)Source Control Systems, e.g.
GitTerraform and infrastructure as code (Terraform)PythonConfiguration Management Tools, e.g. AnsibleBuilding CI/CD PipelineStrong written and verbal communication skills in EnglishNice to have Bonus PointsElasticsearchCassandraVagrantDocker/ECS Fargate/NomadVaultGolang Benefits & Perks
Apply Here
For Remote Senior Site Reliability Engineer roles, visit Remote Senior Site Reliability Engineer Roles

********

Site Reliability Engineer at Genesis Global Trading, Inc.

Location: New York

About GenesisGenesis is a global leader in institutional digital asset markets. We provide a single point of access for digital asset trading, derivatives, borrowing, lending, custody and prime brokerage services.

Genesis facilitates billions in trades, loans and transactions on a monthly basis. We have a proven track record driving results for some of the largest digital asset-focused hedge funds, quant funds, family offices, VC’s, market makers and exchanges.

Join Our TeamGenesis is expanding to solve some of the toughest challenges in digital asset financial markets.
About This Role – Site Reliability Engineer (SRE)Genesis Trading is seeking a talented Site Reliability Engineer with experience building and designing systems, monitors, tools, frameworks, and methodologies to ensure the reliability of our trading platforms. You will join the SRE team who work closely with software development and engineering teams—positioned as the stewards of our production systems.

Responsibilities• Design and implement a wide variety of systems that support our codebase. Primary focus being cloud-native, and Kubernetes systems.
• Define and manage meaningful and actionable SLI/SLO metrics
• Recommend and execute platform changes to improve service-levels
• Build infrastructure as code templates that allow Devops to deploy
• Manage existing and build new continuous integration pipelines
• Maintenance of all environments via automated patching systems
• Participates in releases and rotating on-call schedules
• Owns production incident response
• Design and manage alerting to react to breaches of SLOs
• Automate platform/system recovery

Personal Attributes• Excellent communicator
• Excellent presentation skills and strong negotiation skills
• Superior time management skills
• Proven track record of strong scope and change control

Requirements• Bachelor’s degree in computer science or a related discipline, or equivalent work experience required.
• 3+ years of experience in SRE, DevOps, SWE or cloud architecture roles.
• Hands on experience in Public Cloud, Terraform/Ansible/Cloudformation, PagerDuty/OpsGenie, Kubernetes, Linux, modern monitoring platforms

• AWS
• Datadog
• Grafana
• Kibana
• Bash
• Python
• Jira
• Jenkins
• Puppet
• EKS
• Git/GitLab
• KubernetesEKS, Fargate, ECS, kubectl
• Infrastructure as code

Benefits• Competitive benefits package
• Flexible time off

Why GenesisGenesis is dedicated to creating best-in-class infrastructure for institutional investors, developing advanced products that lower barriers, increase profitability and broaden access. Our team has decades of experience at top Wall Street investment banks and Silicon Valley technology firms, and includes leading experts in blockchain, distributed computing, cryptography and cybersecurity.

Digital Asset Ecosystem

Genesis operates at the heart of the digital asset ecosystem:

→ We are a subsidiary of Digital Currency Group (DCG), the largest investor in the bitcoin and blockchain space.

→ We have an unparalleled global network at the intersection of data, finance and technology.

Accelerating Momentum

Our addressable market is expanding rapidly as new institutional investors enter the space. Recent results include:

→ 300%+ Y/Y trading volume growth

→ 400%+ Y/Y loan origination growth

→ 300%+ Y/Y active loan growth

→ 150%+ Q/Q derivatives transaction growth

Diversity And Inclusion

Genesis invests in creating a welcoming environment where everyone can feel supported and connected at work.

→ We help employees develop a deep understanding of the problems we’re trying to solve.

→ We thrive on ideas driven by a broad range of perspectives.

→ We believe diverse teams lead to better products and bigger outcomes.

Please review our Privacy Polices for CCPA and GDPR here

Genesis is an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
Apply Here
For Remote Site Reliability Engineer roles, visit Remote Site Reliability Engineer Roles

********

Site Reliability Engineer at Landing

Location: New York

At Zapier (Permanent)

Expires at: 2022-10-23Remote policy: Global remoteAs part of this team, you’ll work onDesigning and deploying our AWS infrastructure using infrastructure as code across multiple accounts.
Contributing to our container orchestration clusters (EKS) and serverless functions (Lambda). Production Engineering provides compute resources as a service, and you’ll help shape what features we offer.
Evaluating new tools and recommending technologies to the entire organization. If there’s a tool that will help us serve our customers, we’ll go get it.
Partnering with teams to solve novel infrastructure and design problems. Service teams are responsible for keeping services running. It’s your job to help them make decisions that scale.
Building services to integrate systems, process high-traffic workloads, and perform critical migrations.

We don’t believe in drawing a hard line between developers and SREs–if you see a part of the code you can improve, default to action and make the change.
Using site reliability principles, you’ll help fix problems at their root cause rather than just the symptoms. You’ll improve application reliability using a software engineering approach to operations. You’ll develop internal tools and systems to help engineering teams ship better software, faster. You’ll get to impact every engineering team in the organization and use a broad set of technologies. Maintaining excellent relationships and communicating effectively with teams will be crucial to your success.
Building new features and services is a big part of this role.

We continually develop and implement new ways to support our teams, understand our customers’ needs, and become experts in site reliability.

When bad things happen, you’ll have the support of your team to solve contributing causes, learn from failures, and build robust and resilient systems for our customers.
We look for the solution that automates the problem away, not one that requires manual effort.

If you’re interested in making a big impact and taking our infrastructure to the next level at a fast-growing and profitable startup, then read on.

Our Commitment to ApplicantsCulture and Values at ZapierZapier Guide to Remote WorkZapier Code of ConductDiversity and Inclusivity at ZapierZapier is proud to be an equal opportunity workplace dedicated to pursuing and hiring a diverse workforce.
Main requirementsYou want to learn about SRE on the job, but you’ve done your homework already. You have background in the world of systems administration, systems engineering, software development or quality assurance.

You’re passionate about SRE and you have learned a lot about it already – in your previous role or on your own “pet projects”.You’ve played around in the cloud.

You’re comfortable creating and optimizing a Docker image for your app, you’ve deployed cloud infrastructure with Terraform, you know how to work with Git, you’ve worked with Grafana and some time series database to visualize metrics, you’ve done Kelsey Hightower’s “Kubernetes The Hard Way”.You can code.
You have experience with languages like Python or Go. Expertise with the fundamentals of software development goes a long way here.
You’re a great communicator.

Not only do you know how to share your knowledge with the team and document things well so they can be consumed asynchronously (we do this a lot as a remote company), but you know how to communicate effectively with software and support teams.
You value our values. At Zapier, our values are at the heart of how we collaborate and how we think about our customers. In our remote setting, they help develop trust and ensure we work and collaborate together to democratize automation.

You see how these values can empower meaningful work, you thrive in a collaborative setting, you are eager to continue growing and excited to be part of the team.
Things We’ve Done Recently

Develop new methods for retaining task historyMigrating applications and services from EC2 to KubernetesWrite custom Kubernetes controllers to improve resilienceCreate deployment pipelines in GitLab and ArgoCDDevelop autoscaling strategies to handle bursts in workloadsImplementing OPA to enforce policies across our Kubernetes ClustersDeploying ProxySQL for pooling connections against MySQL databasesBenefits & PerksThe Whole PackageLocation:

EMEA (UTC+3)Our flexible, distributed environment lets us work with the best people from around the world.

Zapiens live in 40+ countries, including the United Kingdom, Thailand, India, Nigeria, Taiwan, Guatemala, New Zealand, Australia, and more
Zapier offers:
Competitive salary and profit-sharing program
Equity for All:

Stock options (or equivalent) for every ZapienHealthcare + dental + vision coverage*Retirement plan with 4% company match*$2,000 annual learning stipend for use on courses, conferences, and more—your choiceTwo annual all-company retreats14 weeks paid leave for new parents of biological or adopted childrenCustomized Zapiversary rewards on your 1, 3, 5, 7 and 10 year work anniversariesLeading-edge equipment.

We set you up with an Apple laptop and provide an additional budget for you to choose other home office accessories and software you may need.
Time to renew. We encourage Zapiens to take at least 2 weeks off each year. Most of us take 4-5 weeks, in addition to locally recognized holidays.

Opportunity to work with Zapier’s amazing partners network*While we take care of Zapiens around the world the best we can, healthcare and retirement plans are currently available specifically in the UK, Canada, New Zealand, Australia, and United States.
Apply Here
For Remote Site Reliability Engineer roles, visit Remote Site Reliability Engineer Roles

********

The Tech Career Guru
We will be happy to hear your thoughts

Leave a reply

Tech Jobs Here
Logo