PhD Research Intern, Large Language Models - 2025

NVIDIA

NVIDIA

Multiple locations
Posted on Oct 22, 2024

By submitting your resume, you’re expressing interest in one of our 2025 Large Language Models focused Internships. We’ll review resumes on an ongoing basis, and a recruiter may reach out if your experience fits one of our many internship opportunities.

NVIDIA pioneered accelerated computing to tackle challenges no one else can solve. Our work in AI and digital twins is transforming the world's largest industries and profoundly impacting society — from gaming to robotics, self-driving cars to life-saving healthcare, climate change to virtual worlds where we can all connect and create. We are passionate about research that pushes boundaries but also has impact in the real world. You will be part of an amazing collaborative research team that consistently publishes at the top venues in machine learning and systems.

Our internships offer an excellent opportunity to expand your career and get hands on with one of our industry leading Large Language Models Research teams. We’re seeking strategic, ambitious, hard-working, and creative individuals who are passionate about helping us tackle challenges no one else can solve.

What you'll be doing:

  • Investigate novel approaches to infuse theory-of-mind reasoning into the post- or pre-training phases of large language models

  • Collaborate with other team members, teams, and/or external researchers.

  • Transfer your research to product groups to enable new products or types of products.

  • Opportunity to publish original research.

What we need to see:

  • Currently pursuing a PhD Degree in Computer Science/Engineering, Electrical Engineering.

  • Research experience in at least one of the following areas:

  • Large Language Models – training, alignment, and evaluation

  • Foundation Models

  • Multimodal Models/Agents

  • Vision-Language Models

  • Deep Learning, Model Compression, and Acceleration Techniques

  • Pruning

  • Quantization

  • NAS

  • Efficient Backbone Architecture

  • Distillation

  • Neural Architecture Search

  • Strong research track record and publication record at top-tier conferences.

  • Excellent communication skills.

  • Excellent programming skills in some rapid prototyping environment such as Python; C++ and parallel programming (e.g., CUDA) is a plus

  • Hands-on experience with large-scale model training is a plus.

  • Knowledge of common machine learning frameworks, such as PyTorch

NVIDIA is widely considered one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world. Are you a creative and collaborative researcher with a real passion for computer graphics? If so, we want to hear from you!

The hourly rate for our interns is 30 USD - 90 USD. Our internship hourly rates are a standard pay determined based on the position and your location, year in school, degree, and experience.

You will also be eligible for Intern benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.