https://github.com/X-rayLaser/DistributedLLM

https://docs.vllm.ai/en/latest/

https://www.youtube.com/watch?v=IRhCi8rFJDg

https://medium.com/@jain.sm/sharding-large-models-for-parallel-inference-ee19844cc44

https://www.youtube.com/watch?v=i6zVvfvIFpc

https://www.youtube.com/watch?v=JA1l96tjrs4

https://github.com/ray-project/ray

https://avik-datta-15.medium.com/how-to-setup-apache-airflow-on-hpc-cluster-ea2575764b43

https://www.youtube.com/watch?v=IRhCi8rFJDg

https://www.youtube.com/watch?v=z2M8gKGYws4

https://docs.vllm.ai/en/latest/