Accelerated Computing Engineer – Entry Level

We seek a driven Accelerated Computing Engineer to join our innovative team in Vellore. This entry-level role offers a unique opportunity to work with advanced AI/ML models, accelerated computing technologies, and cloud infrastructure while collaborating on cutting-edge research and deployment projects. You will work with a variety of state-of-the-art models such as BGE-Large, Mixtral, Gemma, LLaMA, and Stable Diffusion, as well as other fine-tuned architectures, to solve real-world computing challenges through advanced AI/ML infrastructure solutions.

Job Responsibilities

● Customer Interaction & Analysis: Work closely with customers to analyze technical and business needs, translating them into robust, AI-driven solutions.

● Model Deployment & Optimization: Develop and deploy advanced AI/ML models such as LLaMA, Mixtral, Gemma, and other GenAI models while optimizing their

performance for varied computing environments.

● Performance Testing & System Benchmarking: Execute advanced test scenarios and performance benchmarks across AI/ML models and distributed systems to ensure

optimal performance.

● Infrastructure & Model Research: Research, configure, and maintain infrastructure solutions (using tools like TensorRT and PyTorch) supporting our models and accelerated computing workloads.

● AI/ML Model Integration: Support and deploy models such as Stable Diffusion, BGE, Mistral, and custom fine-tuned models into end-to-end pipelines for AI/ML-driven

solutions.

● Automation & Process Improvements: Drive automation strategies to streamline workflows, improve testing accuracy, and optimize system performance.

● Technical Liaison: Served as the technical bridge by collaborating with product development teams, tracking customer feedback, and ensuring timely resolutions.

● Model Configuration & Troubleshooting: Create custom scripts, troubleshoot advanced configurations, and support tuning efforts for AI/ML model customization.

Skill Sets and qualifications required:

‍Required Skills:

● Bachelor’s or Master’s degree in Computer Science, Engineering, or related technical discipline.

● Strong foundational knowledge of AI/ML model deployment and cloud infrastructure.

● Proficiency with AI/ML frameworks & libraries, including PyTorch, TensorRT, and Triton.

● Hands-on experience with deployment models such as LLaMA, Mixtral, Gemma, and Stable Diffusion.

● Familiarity with distributed computing environments and orchestration tools like Kubernetes.

● Proficiency in workflow automation, performance tuning, and large-scale system debugging.

● Understanding of cloud computing technologies and infrastructure architecture, including

storage, networking, and computing paradigms.

‍Preferred Skills:

● Experience working with object storage technologies like AWS S3, Azure Blob Storage, and MinIO.

● Familiarity with advanced AI/ML model frameworks such as Gemma-2b, Mixtral-8x7b, Mistral-7b-instruct, and other fine-tuned AI models.

● Expertise in GPU configuration and tuning for AI/ML workloads, including drivers and machine learning optimization strategies.

● Familiarity with serverless computing and Function as a Service (FaaS) concepts.

● Experience with infrastructure as code (IaC) and performance benchmarking methodologies.

‍

Open Positions

Browse Open Positions

Assistant Vice President (AVP) of Finance

The Assistant Vice President (AVP) of Finance plays a crucial role in overseeing and managing the financial functions within the organization. This leadership position involves collaborating with various departments to ensure accurate financial reporting, strategic financial planning, and compliance with regulatory requirements. The AVP of Finance will contribute to the overall financial health and success of the organization by implementing sound financial practices and driving efficiency.

New Delhi

Full-Time

Senior Manager - Finance

We are looking for a senior manager in finance with over 8 years of experience.

New Delhi

Full-Time

About E2E Networks

E2E Networks is the leading hyperscaler from India with focus on scalable Cloud GPU infrastructure, listed on the National Stock Exchange (NSE). The company is popular for providing accelerated cloud computing solutions, including cutting-edge Cloud GPUs like NVIDIA A100 GPUs and DGX Super Computing on the Cloud, making it the sole provider of advanced Cloud GPU capabilities in India.

E2E Networks Cloud computing solutions are built on the principles of affordability, assistance, accessibility, accommodative, and Atma Nirbhar Bharat (self-reliant India), which are collectively referred to as the 5As of E2E Cloud. The company has been instrumental in helping India become self-reliant in the cloud infrastructure by offering a true public cloud platform that is multi-region, smart dedicated compute, and designed to cater to the unique needs of Higher Education and Research, Enterprises businesses and next generation of AI/ML startups in the country.

Our platform has further strengthened its position as the leading accelerated computing cloud platform from India by demonstrating its capabilities in the Al/ML, NLP, Computer Vision and Generative AI on its Cloud GPU and DGX platforms. The company has well earned its reputation as a trusted and reliable partner of choice for Higher Education and Research Institutions, Enterprises and AI/ML startups in India as well as globally.

E2E Networks was amongst the first few providers out of India providing contactless computing with low latency. The company's advanced Cloud Computing solutions, including Cloud GPUs like NVIDIA H200 & H100 are aimed at helping India rise as an AI/ML superpower transforming Higher Education, Research and Enterprises across industry and academia..

Learn More

A vector illustration of a tech city using latest cloud technologies & infrastructure