Senior Site Reliability Engineer, Infrastructure Foundations

🕒 May 13

Apply Now
Find Similar Remote Jobs

📊 Check your resume score for this job

Improve your chances of getting an interview by checking your resume score before you apply.

Logo of Wikimedia Foundation

Wikimedia Foundation

501 - 1000 employees

Founded 2003

🤝 Non-profit

📚 Education

📱 Media

💰 $2.5M Grant on 2019-09

Non-profit • Education • Media

Wikimedia Foundation is a nonprofit charitable organization dedicated to the growth, development, and distribution of free, multilingual content. It provides the essential infrastructure for free knowledge, including hosting Wikipedia, the free online encyclopedia that is created, edited, and verified by a global community of volunteers. Supported primarily through donations, Wikimedia Foundation promotes collaborative projects that aim to share knowledge reflecting human diversity and strives to protect everyone's right to access free and open knowledge.

📋 Description

• Performing day-to-day operational/DevOps tasks on Wikimedia’s public facing infrastructure (deployment, maintenance, configuration, troubleshooting) • Implementing and utilizing configuration management and deployment tools (Puppet, Kubernetes) • Leading continuous improvement, by automating the installation, configuration and maintenance of services on our platform • Work closely with product teams helping them bring scalable functionality to our users by assisting in the architectural design of new services and making them operate at scale • Participating in a 24/7 on-call rotation shared across the broader SRE team. This includes taking part in incident response, diagnosis and follow-up on system outages or alerts across Wikimedia’s production infrastructure. • Collaborating with a global, cross-functional team in an asynchronous communication environment • Mentoring peers in your areas of technical and operational strength • Ability and willingness to travel 1-2 times a year for in-person events and team meetings

🎯 Requirements

• 6+ years of experience in an SRE/Operations/DevOps role as part of a team • Experience with shell and any scripting languages used in an SRE context (Python, Go, Bash, Ruby; we primarily use Python) and configuration management tools (Puppet, Ansible; we use Puppet) • Experience designing and managing infrastructure security for large fleets of diverse services • Experience with technical response during security incidents • Experience with package management on Linux systems (we use Debian) • Strong Linux system-level troubleshooting skills • History of automating tasks and processes, identifying process gaps, and finding automation opportunities • Strong English language skills (verbal and written) and ability to work independently, as an effective part of a globally distributed team working across multiple time zones • Experience leading and participating in incident response and post-incident review rituals, with the goal of conducting root cause analysis and implementing preventive measures

🏖️ Benefits

• Competitive salary • Health insurance • Flexible working hours • Professional development opportunities

Apply Now

Similar Jobs

🕒 May 13

BCW Group

51 - 200

🔌 API

🤖 Artificial Intelligence

🌐 Web 3

Senior Systems Administrator at BCW Technologies managing large scale systems in a remote setting. Responsibilities include setup, configuration, and optimization of Linux servers while coordinating with team members.

🇺🇸 United States – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🕒 May 13

Jito Labs

1 - 10

Release Engineer managing software upgrades and releases in Jito’s blockchain infrastructure team. Focusing on operational execution across various high-stakes systems and repositories.

🇺🇸 United States – Remote

💵 $180k - $200k / year

💰 $10M Series A on 2022-08

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🕒 May 13

WEX

5001 - 10000

🚗 Transport

💸 Finance

💳 Fintech

SRE Architect driving AI-Powered Reliability Engineering strategy and enforcing enterprise-wide SRE standards. Overseeing the architecture and implementation of mission-critical systems for WEX.

🇺🇸 United States – Remote

💵 $200.6k - $250.4k / year

💰 $310M Post-IPO Debt on 2020-06

⏰ Full Time

🟠 Senior

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

info

🕒 May 13

Guild Mortgage

1001 - 5000

💸 Finance

🏠 Real Estate

Senior DevOps Engineer responsible for designing automated build and release processes in a mortgage banking firm. Collaborating across teams to enhance system security and performance.

🇺🇸 United States – Remote

💵 $109k - $156k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🕒 May 13

FICO

1001 - 5000

💸 Finance

🤖 Artificial Intelligence

☁️ SaaS

Senior DevOps Engineer with extensive Kubernetes and AWS experience at FICO. Responsible for CI/CD pipeline architecture and cloud security compliance.

🇺🇸 United States – Remote

💵 $115.5k - $181.5k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)