Banner for Domain Adaptive Pretraining for Model Customization with NVIDIA

Domain Adaptive Pretraining for Model Customization with NVIDIA

by PICSciE/Research Computing

Training/Workshop Programming Languages Research & Data Analysis

Thu, Apr 3, 2025

3 PM – 4 PM EDT (GMT-4)

Add to Calendar

Online Event

33
Registered

Registration

Details

This workshop is an overview of how pre-trained LLMs can be customized and applied to specific domains using domain adaptation techniques: custom tokenization and domain adapted continued pre-training with curated domain specific data, and supervised fine tuning with domain specific instructions. NVIDIA will walk through how these techniques were applied to ChipNeMo - an LLM customized for industrial chip design which was then used for building a code generation assistant, a bug summarization and analysis assistant, and an engineering chatbot assistant. Results show that these domain adaptation techniques enable significant LLM performance improvements over general-purpose base models in domain-related downstream tasks without degradation in generic capabilities.

References: Format: Presentation with open Q&A

See the full PICSciE/RC spring training program or subscribe to the PICSciE/RC mailing list.

Speakers

Sugandha Sharma's profile photo

Sugandha Sharma

Senior Generative AI Architect and Scientist

NVIDIA

Sugandha is a Senior Generative AI Architect and Scientist working on LLMs at NVIDIA. Previously, she worked as a Research Scientist at Microsoft Research. Sugandha holds a Ph.D. in Brain and Cognitive Sciences from MIT.

Hosted By

PICSciE/Research Computing | View More Events
Co-hosted with: GradFUTURES

Contact the organizers