GPU / CUDA

https://www.youtube.com/watch?v=6wWdNg0GMV4

https://developers.google.com/machine-learning/guides/rules-of-ml

https://ai-infrastructure.org/four-steps-to-make-ml-models-run-faster-in-production/

https://github.com/ray-project/llmperf-leaderboard

Automatic Speech Recognition with NeMo - https://colab.research.google.com/github/NVIDIA/NeMo/blob/stable/tutorials/asr/ASR_with_NeMo.ipynb#scrollTo=lJz6FDU1lRzc

https://medium.com/@abonia/running-ollama-in-google-colab-free-tier-545609258453

https://aws.amazon.com/getting-started/hands-on/deploy-a-machine-learning-model-to-a-rest-api/

AWSSagemaker: https://www.youtube.com/watch?v=LkR3GNDB0HI&list=PLZoTAELRMXVONh5mHrXowH6-dgyWoC_Ew

MLOPS: https://www.youtube.com/watch?v=ZET50M20hkU&list=PLJDPDCcIcu77ILdQDGZZQc0rHswZUr-3B

https://www.youtube.com/watch?v=VokAGy8C6K4&pp=ygUGbWxmbG93

https://www.youtube.com/watch?v=xcODUk0o6tU

https://developer.apple.com/metal/

https://developer.apple.com/metal/pytorch/

https://www.youtube.com/playlist?list=PLkz_y24mlSJZJx9sQJCyFZt50S4ji1PeR

https://www.youtube.com/watch?v=ZVWg18AXXuE

https://madewithml.com/#mlops