common.header.logo.altText
Job details

Apply to similar jobs before anyone else

  • Only relevant jobs, no spam.
  • Only new postings.
  • Unsubscribe at any time.
By creating a job alert, I agree to process my data and to send me email alerts, as detailed in our Privacy Policy.
Or

Senior Ai Infrastructure Engineer

Stealth Mode Start-up

Posted 14/4/2025

Job category:

Engineering

Description:

🚀 Senior ML Infrastructure EngineerRemote | Full-Time | Competitive Salary + BenefitsJoin us in shaping the future of ML deployment and operations.We’re on the lookout for a Senior ML Infrastructure Engineer to strengthen our ML Platform team. In this role, you'll work at the crossroads of hardware-software co-design, cloud infrastructure, and ML operations, driving innovation in how large-scale ML models are deployed and optimized. If you're passionate about building scalable ML systems and want to dive deep into LLM infrastructure and inference optimization, this is the role for you.💡 What You'll Do:Architect & Optimize ML Inference Systems — Work on the core components of our proprietary ML platform, scaling and streamlining large-scale model deployments.Hardware-Software Co-Design — Push the limits of performance by bridging low-level optimizations with high-level ML frameworks.Enhance Distributed AI Systems — Leverage libraries like PyTorch, DeepSpeed, and CCL/NCCL for distributed training and inference.Extend ML Capabilities — Develop C++ extensions for Python to fine-tune performance and enable seamless integration.ML Ops & Cloud Integration — Build CI/CD pipelines, automate testing infrastructures, and ensure smooth delivery of enterprise-grade ML systems.Empower GenAI Applications — Contribute to enabling Generative AI solutions for real-world use cases.✅ What We’re Looking For:5+ years of hands-on software engineering experience, with a focus on building production-grade systems.3+ years working with ML infrastructure or LLM inference, including PyTorch internals, custom operators, and distributed AI systems.Proficiency in C++, with experience creating C++ extensions in Python.Experience with cloud platforms, containerization, and ML Ops best practices.A Bachelor’s degree in Computer Science, Engineering, or a related field—or equivalent practical experience.💎 Bonus Points If You Have:A Master’s or PhD in Computer Science, Engineering, or related fields.Familiarity with distributed inference / frameworks ( NCCL, CCL, GLOO, DeepSpeed, TorchServeHands-on experience in CI/CD pipeline management and automating testing infrastructure.Prior exposure to enabling Generative AI applications for enterprise use.🎁 What We Offer:Work on cutting-edge ML infrastructure with a talented, collaborative team.A remote-friendly work environment with flexible hours.Competitive salary and a comprehensive benefits package.Opportunities for professional growth and continuous learning.A chance to shape the next generation of ML platforms.🌟 About Us:We’re building the future of ML inference platforms, empowering organizations to deploy, scale, and manage machine learning models effortlessly. Our solutions blend intuitive user experiences with robust backend architectures, making ML deployment efficient and accessible.Ready to Build the Future of ML?To apply, submit your resume, portfolio of relevant projects, and a brief description of your experience with ML infrastructure and large-scale inference systems.We are an equal opportunity employer. Diversity drives innovation, and we’re committed to fostering an inclusive environment where all voices are heard.🚀 Let’s push the boundaries of what’s possible in ML together!

Hiring insights:

14/4/2025
  1. Jobs in India
  2. Ml Infrastructure Engineer jobs in India
  3. Senior Ai Infrastructure Engineer