Logo
Visit Our Website

http://weka.io

Senior DevOps Engineer

About The Position

WEKA is architecting a new approach to the enterprise data stack built for the age of reasoning. NeuralMesh by WEKA sets the standard for agentic AI data infrastructure with a cloud and AI-native software solution that can be deployed anywhere. It transforms legacy data silos into data pipelines that dramatically increase GPU utilization and make AI model training and inference, machine learning, and other compute-intensive workloads run faster, work more efficiently, and consume less energy.

WEKA is a pre-IPO, growth-stage company on a hyper-growth trajectory. We’ve raised $375M in capital with dozens of world-class venture capital and strategic investors. We help the world’s largest and most innovative enterprises and research organizations, including 12 of the Fortune 50, achieve discoveries, insights, and business outcomes faster and more sustainably. We’re passionate about solving our customers’ most complex data challenges to accelerate intelligent innovation and business value. If you share our passion, we invite you to join us on this exciting journey.

As a Senior DevOps Engineer at Weka, your primary responsibility will be collaborating with other team members on our high-performance filesystem solution and releasing our kernel driver, which is written in C on top of Linux, as part of the Weka filesystem product. 

What you’ll be doing: Weka is at the forefront of automation, continuously striving to streamline our operations across all aspects of our infrastructure. We maintain a vast multi-cloud environment spanning AWS, GCP, Azure, and OCI, as well as a robust on-premises infrastructure. Our team is responsible for maintaining and optimizing these diverse environments.

Key Responsibilities: As a Senior DevOps Engineer, you'll play a crucial role in supporting and expanding our automation efforts. You'll be responsible for designing, implementing, and maintaining scalable and reliable infrastructure solutions across our multi-cloud and on-premises environments.

Required Qualifications:

  • Strong Linux administration skills with at least 5 years of experience in production environments
  • In-depth knowledge of at least one major cloud platform (AWS, GCP, Azure, or OCI)
  • Proficiency in Python programming
  • Extensive experience with Kubernetes and container technologies like Docker
  • Solid understanding of infrastructure-as-code principles and experience with tools like Terraform
  • Familiarity with monitoring and alerting systems for DevOps operations
  • At least 5 years Prior experience as a DevOps Engineer or Site Reliability Engineer (SRE)
  • Strong problem-solving skills and ability to work in a fast-paced environment

Preferred Qualifications:

  • Experience with VMware or bare metal server management
  • Networking knowledge and experience
  • Familiarity with Go programming language
  • Experience designing and implementing CI/CD processes and build systems
  • Knowledge of security best practices for cloud and on-premises environments