Site Reliability Engineer

Job not on LinkedIn

September 18

Apply Now
Logo of Blockdaemon

Blockdaemon

Blockchain • Web 3 • Finance

Blockdaemon is a company that provides comprehensive blockchain infrastructure services tailored for institutional clients, including financial institutions and crypto-native companies. They offer a unified platform ('daemonOS') for secure node operation, staking, wallet management, and blockchain transaction facilitation. Blockdaemon’s services allow users to build, launch, and grow blockchain-based protocols with access to real-time blockchain data, API integration, and high-security standards. Their solutions support various major blockchain protocols like Ethereum, Polkadot, and Solana, ensuring seamless integration and scalability for Web3 applications. The company is committed to facilitating the blockchain economy with reliable and compliant infrastructure solutions.

201 - 500 employees

Founded 2017

🌐 Web 3

💸 Finance

💰 $33.1M Venture Round on 2022-07

📋 Description

• Support Blockdaemon team to ensure reliability, scalability, and performance of systems and services in a multi-cloud environment. • Collaborate with software engineering teams to design scalable, highly available, and resilient systems. • Implement Infrastructure as Code to manage services and deployments in a multi-cloud, multi-project configuration. • Develop automation tools and scripts to streamline deployment, monitoring, and incident response processes. • Configure and maintain monitoring systems; define alerting thresholds and response procedures. • Respond to and resolve critical incidents; perform root cause analysis and implement preventive measures; participate in on-call rotation. • Analyze system performance metrics, identify bottlenecks, and propose optimizations for capacity planning and performance. • Work with security teams to implement data protection, access control, and compliance; conduct security audits and vulnerability assessments. • Document system configurations, procedures, and troubleshooting steps; share knowledge and best practices with team members.

🎯 Requirements

• Proven experience in an independent contributor role working with cloud platforms: GCP, AWS, Azure, Infrastructure-as-Code tooling: Terraform, Helm, and CI/CD orchestration platforms: GitlabCI, ArgoCD, Github Actions or similar GitOps workflows. • Excellent problem-solving skills and the ability to independently troubleshoot complex issues. • Strong communication and collaboration skills, with the ability to work effectively in cross-functional teams. • Strong Architectural & Security Mindset. • Strong understanding of Linux/Unix systems administration and networking concepts. • Hands-on experience with configuring and running monitoring tools like Prometheus, Grafana, etc. • 5+ years experience of maintaining infrastructure-as-code on Google Cloud Platform, Amazon Web Services and Azure. • Experience working in SOC 2 Type 1 and Type 2 certified companies. • Proficiency in scripting and programming languages such as BASH, Golang, Python and TypeScript. • 2+ years hands-on experience operating highly available Kubernetes clusters. • Experience being involved in incident management and resolution. • Experience with AI development tools and related security considerations. • Passion for the Blockchain Industry & Decentralised Systems. • Experience with Blockchain Infrastructure, either in a personal or professional capacity.

Apply Now

Similar Jobs

June 27

Blackfluo.ai

2 - 10

🤖 Artificial Intelligence

🎯 Recruiter

☁️ SaaS

Join a remote team to build secure, scalable infrastructure for SaaS solutions in Cloud Computing.

🇩🇰 Denmark – Remote

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Developed by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com