NVIDIA Hackathon FAQ: https://docs.google.com/document/d/1jhw9V79gxOr8tGxkIN-hEkiN9htXye7ev0IRYoT8uEc/edit?tab=t.0
Getting Started Guide: https://docs.google.com/document/d/1W_ClcEYBsNeLvDZSYImYG1CrrSTSj7Fo_ZOE_Ev1D1k/edit?_hsenc=p2ANqtz-8B3iYZ1lPLR7sjyJT_aWpStbvE6sZYVzvcDBJCW1AaC4FnbJuodvfQDG_RNbVTQk7rd0bCdzQjoHzmWdzf3gLZ0RAcfQ&_hsmi=330844551&tab=t.0#heading=h.8eu9ji7smdv6
https://github.com/NVIDIA/NeMo-Curator/blob/main/tutorials/pretraining-data-curation/red-pajama-v2-curation-tutorial.ipynb
https://github.com/NVIDIA/NeMo-Curator
https://github.com/chrisalexiuk-nvidia/ODSC-Hackathon-Repository
https://developer.nvidia.com/blog/streamlining-data-processing-for-domain-adaptive-pretraining-with-nvidia-nemo-curator/
https://build.nvidia.com/explore/discover