Kanchipuram, Tamil Nadu
Full-Time
0–2 Years

Accelerated Computing Engineer – Entry Level

We seek a driven Accelerated Computing Engineer to join our innovative team in Vellore. This entry-level role offers a unique opportunity to work with advanced AI/ML models, accelerated computing technologies, and cloud infrastructure while collaborating on cutting-edge research and deployment projects. You will work with a variety of state-of-the-art models such as BGE-Large, Mixtral, Gemma, LLaMA, and Stable Diffusion, as well as other fine-tuned architectures, to solve real-world computing challenges through advanced AI/ML infrastructure solutions.

Compensation

Job Responsibilities

● Customer Interaction & Analysis: Work closely with customers to analyze technical and business needs, translating them into robust, AI-driven solutions.
● Model Deployment & Optimization: Develop and deploy advanced AI/ML models such as LLaMA, Mixtral, Gemma, and other GenAI models while optimizing their
performance for varied computing environments.
● Performance Testing & System Benchmarking: Execute advanced test scenarios and performance benchmarks across AI/ML models and distributed systems to ensure
optimal performance.
● Infrastructure & Model Research: Research, configure, and maintain infrastructure solutions (using tools like TensorRT and PyTorch) supporting our models and accelerated computing workloads.
● AI/ML Model Integration: Support and deploy models such as Stable Diffusion, BGE, Mistral, and custom fine-tuned models into end-to-end pipelines for AI/ML-driven
solutions.
● Automation & Process Improvements: Drive automation strategies to streamline workflows, improve testing accuracy, and optimize system performance.
● Technical Liaison: Served as the technical bridge by collaborating with product development teams, tracking customer feedback, and ensuring timely resolutions.
● Model Configuration & Troubleshooting: Create custom scripts, troubleshoot advanced configurations, and support tuning efforts for AI/ML model customization.

Skill Sets and qualifications required:

Required Skills:

● Bachelor’s or Master’s degree in Computer Science, Engineering, or related technical discipline.
● Strong foundational knowledge of AI/ML model deployment and cloud infrastructure.
● Proficiency with AI/ML frameworks & libraries, including PyTorch, TensorRT, and Triton.
● Hands-on experience with deployment models such as LLaMA, Mixtral, Gemma, and Stable Diffusion.
● Familiarity with distributed computing environments and orchestration tools like Kubernetes.
● Proficiency in workflow automation, performance tuning, and large-scale system debugging.
● Understanding of cloud computing technologies and infrastructure architecture, including
storage, networking, and computing paradigms.

Preferred Skills:

● Experience working with object storage technologies like AWS S3, Azure Blob Storage, and MinIO.
● Familiarity with advanced AI/ML model frameworks such as Gemma-2b, Mixtral-8x7b, Mistral-7b-instruct, and other fine-tuned AI models.
● Expertise in GPU configuration and tuning for AI/ML workloads, including drivers and machine learning optimization strategies.
● Familiarity with serverless computing and Function as a Service (FaaS) concepts.
● Experience with infrastructure as code (IaC) and performance benchmarking methodologies.

About E2E Networks

E2E Networks is the leading hyperscaler from India with focus on scalable Cloud GPU infrastructure, listed on the National Stock Exchange (NSE). The company is popular for providing accelerated cloud computing solutions, including cutting-edge Cloud GPUs like NVIDIA A100 GPUs and DGX Super Computing on the Cloud, making it the sole provider of advanced Cloud GPU capabilities in India.

E2E Networks Cloud computing solutions are built on the principles of affordability, assistance, accessibility, accommodative, and AtmanirbharBharat (self-reliant India), which are collectively referred to as the 5As of E2E Cloud.The company has been instrumental in helping India become self-reliant in the cloud infrastructure by offering a true public cloud platform that is multi-region, smart dedicated compute, and designed to cater to the unique needs of Higher Education and Research, Enterprises businesses and next generation of AI/ML startups in the country.

Our platform has further strengthened its position as the leading accelerated computing cloud platform from India by demonstrating its capabilities in the Al/ML, NLP, Computer Vision and Generative AI on its Cloud GPU and DGX platforms. The company has well earned its reputation as a trusted and reliable partner of choice for Higher Education and Research Institutions, Enterprises and AI/ML startups in India as well as globally.

E2E Networks is popular for being the first provider of India based low latency and contractless cloud computing in India. The company's advanced Cloud Computing solutions, including Cloud GPUs like NVIDIA A100 and DGX SuperComputing are aimed at helping India rise as an AI/ML superpower transforming Higher Education, Research and Enterprises across industry and academia.

Learn More

Build on the most powerful infrastructure cloud

A vector illustration of a tech city using latest cloud technologies & infrastructure