Thinking!
I am a perpetual learner in the field of Artificial Intelligence, with 8 years of experience in machine learning, data science and software development.
-
Google
- Sunnyvale, California, USA
- http://127.0.0.1:5500/index.html#
- in/ankitadeshmukh-08995
Pinned Loading
-
Qwen-3.6-35B-Model-Serving-with-LoRA-Adapter-on-TPUv6e
Qwen-3.6-35B-Model-Serving-with-LoRA-Adapter-on-TPUv6e PublicQwen 3.6 35B Model Serving with LoRA Adapter on TPUv6e
-
dynamo_disaggregated_serving_gke
dynamo_disaggregated_serving_gke PublicThis guide details the end-to-end process for deploying a Disaggregated Inference of Llama 3.1 70B model on Google Kubernetes Engine (GKE) using A3 Ultra (H200) nodes.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.