Domain Adaptive Pretraining for Model Customization with NVIDIA

Name: Domain Adaptive Pretraining for Model Customization with NVIDIA
Start: 2025-04-03T15:00:00-04:00
End: 2025-04-03T16:00:00-04:00
Location: Online Event

by

Training/Workshop Programming Languages Research & Data Analysis

Thu, Apr 3, 2025

3 PM – 4 PM EDT (GMT-4)

Online Event

33

Registered

Registration

Registration is now closed (this event already took place).

Details

This workshop is an overview of how pre-trained LLMs can be customized and applied to specific domains using domain adaptation techniques: custom tokenization and domain adapted continued pre-training with curated domain specific data, and supervised fine tuning with domain specific instructions. NVIDIA will walk through how these techniques were applied to ChipNeMo - an LLM customized for industrial chip design which was then used for building a code generation assistant, a bug summarization and analysis assistant, and an engineering chatbot assistant. Results show that these domain adaptation techniques enable significant LLM performance improvements over general-purpose base models in domain-related downstream tasks without degradation in generic capabilities.

References:

Format: Presentation with open Q&A

See the full PICSciE/RC spring training program or subscribe to the PICSciE/RC mailing list.

Speakers

Sugandha Sharma

Senior Generative AI Architect and Scientist

NVIDIA

Sugandha is a Senior Generative AI Architect and Scientist working on LLMs at NVIDIA. Previously, she worked as a Research Scientist at Microsoft Research. Sugandha holds a Ph.D. in Brain and Cognitive Sciences from MIT.

Hosted By

Research Computing | View More Events
Co-hosted with: GradFUTURES