Job Description
About the Job:
π’ Company Sarvam
πΌ Role Machine Learning Engineer
π Location Bangalore
β³ Experience 2β5 Years
π Job Type Full Time
Job Description:
Sarvam is hiring a Machine Learning Engineer (Vision) in Bangalore to contribute to building Indiaβs sovereign AI ecosystem. As a fast-growing AI company backed by leading investors, Sarvam focuses on developing full-stack AI solutions spanning research, infrastructure, and applications. In this role, you will work on cutting-edge vision-language models (VLMs), contributing to the development of multimodal AI systems that can process and understand both visual and textual data. This is a unique opportunity to work on high-impact AI projects that address real-world challenges at a national scale.
As a Machine Learning Engineer, you will be involved in the complete lifecycle of AI model development, including data pipeline creation, model training, evaluation, and deployment. You will design and optimize large-scale training pipelines using GPU clusters, implement advanced transformer-based architectures, and experiment with new techniques from ongoing research. The role also includes building production-ready systems, optimizing inference performance, and ensuring scalability and reliability of AI solutions. You will work closely with cross-functional teams and clients to translate business problems into effective machine learning solutions.
Sarvam offers a dynamic, high-performance work environment where engineers are encouraged to take ownership and push the boundaries of AI innovation. You will collaborate with top researchers and engineers, working on problems such as document processing, visual search, and multimodal data understanding. This role is ideal for individuals passionate about artificial intelligence, deep learning, and building impactful AI systems that can transform industries and improve lives at scale.
Roles & Responsibilities:
- Design and implement training and fine-tuning pipelines for large vision-language models using GPU clusters and distributed systems.
- Build scalable multimodal data pipelines for data ingestion, filtering, deduplication, and quality assurance.
- Develop and experiment with advanced AI architectures and training techniques based on transformer models.
- Create evaluation frameworks, benchmarks, and automated regression tracking systems to ensure model performance.
- Optimize models for inference using techniques such as quantization, batching, and efficient serving infrastructure.
- Develop production-grade AI systems, including multimodal pipelines and retrieval-augmented workflows.
- Translate real-world business problems into machine learning tasks with appropriate data and evaluation strategies.
- Collaborate with clients to understand use cases such as document processing, visual search, and structured data extraction.
- Debug and improve deployed models, focusing on latency, accuracy, scalability, and edge-case handling.
- Ensure secure coding practices, maintain code quality, and contribute to system reliability and maintainability.
Requirements & Eligibility:
- Bachelorβs degree in Computer Science, Statistics, Physics, or a related technical field.
- Strong programming skills in Python with hands-on experience in PyTorch for deep learning model development.
- Experience in training or fine-tuning large machine learning models, including debugging and optimization.
- Solid understanding of transformer architectures and modern deep learning techniques.
- Experience building and managing large-scale data pipelines for machine learning applications.
- Familiarity with distributed training frameworks such as FSDP, DeepSpeed, or Megatron-LM is an advantage.
- Knowledge of inference optimization techniques such as quantization, distillation, and efficient model serving.
- Understanding of retrieval-augmented generation (RAG) workflows and multimodal AI systems.
- Strong analytical and problem-solving skills with the ability to work in ambiguous and evolving environments.
- Experience with open-source contributions or a strong GitHub portfolio is highly desirable.
Expected Salary:
The expected salary for a Machine Learning Engineer at Sarvam in Bangalore typically ranges between βΉ20 LPA to βΉ45 LPA, depending on experience, expertise in deep learning and AI systems, and contributions to research or production-grade solutions. Additional benefits may include performance bonuses, stock options, and opportunities to work on cutting-edge AI innovations.
π¨ Stop Scrolling β This Could Be Your Shortcut to Interviews
Most candidates apply to 100+ jobs and never hear back.
The real reason? They donβt know where recruiters are actually hiring from.
Our March Hiring PDF includes verified HR emails and hiring details from companies like:
Dentsu, IBM, HCL, PwC, LTIMindtree, Wipro, Cognizant, Deloitte, Capgemini, Amazon, TCS, Infosys, EPAM, EY, NTT Data, Tech Mahindra, Fractal, GlobalLogic, Coforge, UST and many more.
Inside youβll find:
β 200+ Fresher Job Opportunities
β 2500+ Verified HR Emails & Contacts
β Direct Hiring + Consultancy Openings
β IT & Non-IT Roles
π₯ 60+ students placed recently using these hiring leads
π Grab the March Hiring List Now: March Hiring PDF


