Job Title or Location
RECENT SEARCHES

Cloud & Infrastructure Engineer (REMOTE)

Stack.io - 2 Jobs
Toronto, ON
Remote
Full-time
Experienced
Company Benefits
Flexible Work
Posted 5 days ago
Salary:

Please be sure to add [email protected] to your contact list to ensure delivery of all correspondence from us

⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯

About Us:

We are a fast-paced startup focused on customer discovery and building a unique, scalable platform on baremetal infrastructure. Our platform, based on Kubernetes, integrates cloud-native tools and custom-developed solutions to deliver a cutting-edge environment. We are not a public cloud wrapper; we run our own baremetal cloud, which is a crucial differentiator in our approach. Learn more about our platform here.

Our team has over 15 years of experience in the cloud infrastructure space, from being a private cloud provider to public cloud consulting, we've innovated and grown through the years with the industry. We've continued with the theme of platforms to help developers deploy and make their lives easier in production so all they need to think about is code and their business.

Role Overview:

As a Senior Engineer, you will be responsible for architecting, developing, and maintaining our baremetal Kubernetes platform. This role demands a deep understanding of the Kubernetes ecosystem, a strong development background, and the ability to thrive in a volatile startup environment. You will be involved in every aspect of the platform's lifecycle, from design to implementation and scaling, with a focus on innovation and problem-solving.

Key Responsibilities:

  • Platform Development: Design, build, and maintain our baremetal Kubernetes platform, integrating cloud-native tools with custom-developed components.
  • Toolchain Integration: Develop and hack together tools to optimize and automate platform workflows, ensuring high scalability, reliability, and security.
  • Client Collaboration: Work closely with customers during the discovery phase to gather feedback and iterate on platform features.
  • Infrastructure Management: Oversee the deployment and management of Kubernetes clusters on baremetal infrastructure, ensuring maximum uptime and performance.
  • Leadership & Mentoring: Guide and mentor junior engineers, promoting best practices and fostering a culture of continuous improvement.
  • On-Call Rotation: Participate in a balanced on-call rotation, providing critical support during off-hours and weekends.
  • Community Engagement: Engage with the open-source community to stay ahead of industry trends and contribute to the broader ecosystem.

Required Skills & Experience:

  • Kubernetes Expertise: Extensive experience with Kubernetes, including application management with Helm and K8s Operators. Talos/Baremetal K8s experience is a plus.
  • Development Proficiency: Strong coding skills in languages commonly used in cloud-native environments (e.g., Go, Python, Bash). Ability to develop and maintain custom tools.
  • Infrastructure as Code: Experience with Terraform, Terragrunt, and similar tools for managing baremetal environments.
  • Monitoring & Logging: Hands-on experience with Prometheus, Grafana, Elasticsearch, Kibana, and similar tools for observability.
  • Linux Proficiency: Minimum of 3 years of full-time experience using Linux as your primary OS, with a deep understanding of Linux-based systems.
  • Client-Facing Experience: Proven ability to interact with clients, gather requirements, and manage expectations in a dynamic environment.
  • Problem-Solving & Innovation: Demonstrated ability to solve complex technical problems and innovate in a startup setting.

Preferred Qualifications:

  • Baremetal Cloud: Experience with managing and scaling baremetal infrastructure.
  • Certifications: Relevant certifications (e.g., CKA, CKAD, AWS) are a plus but not required.
  • Leadership: Prior experience in a leadership or mentoring role is highly desirable.
  • Startup Experience: Experience working in a startup environment, with the ability to adapt to rapid changes and shifting priorities.

Our team solves the fundamental problems of architecting, engineering, scaling, automating, securing, and maintaining availability for our clients' web application infrastructures. We continuously ask ourselves how we can make these processes more efficient, reduce errors, and maximize the uptime of their web applications.

You are responsible for technical collaboration with our clients to build and maintain their application infrastructure. To provide the foundation for this service, you engage in designing, coordinating, and implementing the infrastructure components our service uses to continuously improve scalability, reliability, and quality.

⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯⎯

Please be sure to add [email protected] to your contact list to ensure delivery of all correspondence from us

Stack.io welcomes and encourages applications from people with disabilities. Accommodations are available on request for candidates taking part in all aspects of the selection process.


remote work

Share This Job: