https://github.com/X-rayLaser/DistributedLLM
https://docs.vllm.ai/en/latest/
https://www.youtube.com/watch?v=IRhCi8rFJDg
https://medium.com/@jain.sm/sharding-large-models-for-parallel-inference-ee19844cc44
https://www.youtube.com/watch?v=i6zVvfvIFpc
https://www.youtube.com/watch?v=JA1l96tjrs4
https://github.com/ray-project/ray
https://avik-datta-15.medium.com/how-to-setup-apache-airflow-on-hpc-cluster-ea2575764b43
https://www.youtube.com/watch?v=IRhCi8rFJDg