Ankita Deshmukh ankdeshm

💭

Thinking!

I am a perpetual learner in the field of Artificial Intelligence, with 8 years of experience in machine learning, data science and software development.

10 followers · 1 following

Google
Sunnyvale, California, USA
http://127.0.0.1:5500/index.html#
in/ankitadeshmukh-08995

Pinned Loading

Qwen-3.6-35B-Model-Serving-with-LoRA-Adapter-on-TPUv6e Qwen-3.6-35B-Model-Serving-with-LoRA-Adapter-on-TPUv6e Public

Qwen 3.6 35B Model Serving with LoRA Adapter on TPUv6e
dynamo_disaggregated_serving_gke dynamo_disaggregated_serving_gke Public

This guide details the end-to-end process for deploying a Disaggregated Inference of Llama 3.1 70B model on Google Kubernetes Engine (GKE) using A3 Ultra (H200) nodes.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ankita Deshmukh ankdeshm

Block or report ankdeshm

Pinned Loading

Uh oh!