a simple example of using the vLLM runtime to launch a model endpoint on MLDE/determined
det-vllm
includes examples of launching a model on MLDE/determined using vLLMapp
includes an example gradio application to serve the model endpoints with user requests
video demo: https://youtu.be/BoykszZcRWY