I am a Senior AI Relation Engineer at Google DeepMind, where I'm building the first AI DevRel team to bring Google DeepMind's AI research to every developer.
My mission is to help every developer to build with AI responsibly, ethically, and successfully, leveraging everything from open models to foundation models like Gemini and Gemma, across cloud and on-device platforms.
Before joining Google DeepMind in 2025, I was a Technical Lead at Hugging Face, leading strategic collaborations and partnerships with major cloud providers AWS, Google Cloud, Azure, Cloudflare, Digial Ocean, Dell...).
At Hugging Face, I was instrumental in growing our revenue from $0 to ~$100 million in 4 years through our cloud and hardware offerings. I served as the Technical/Engineering Lead for key partnerships, creating solutions like Hugging Face Inference Endpoints to simplify LLM deployment. My focus has consistently been on making advanced AI, particularly LLMs and RLHF, accessible and practical for real-world applications.
Beyond deployment, I collaborated with Hugging Face's open-source and science teams to improve LLM accessibility and focused on leveraging RLHF for enterprise and business use cases, including Zephyr, SmolLM, StarCoder and more.
My passion for cloud computing and AI began over 10 years ago. I've designed and implemented multiple cloud-native AI architectures for various industries and was recognized as the first German AWS Machine Learning Hero in 2021. I actively share my knowledge through research, blog posts, and on LinkedIn, and X (formerly Twitter).
On my blog philschmid.de, I break down complex AI concepts, share practical tutorials, and provide insights into the latest advancements in the field, a practice I continue to this day.
Below is a list of Technologies (mostly open source frameworks, libraries and languages) I regularly use and enjoy working with. If you want to see more what i do or have done, check out my GitHub.
🤖 Machine Learning
Transformers, PyTorch, Scikit-Learn, Langchain, Weights & Bias, Deepspeed, TensorRT, Triton, ONNX.
☁️ Cloud
AWS, GCP, Azure, Kubernetes, Kubeflow, Docker, Terraform, Github Actions, CDK.
🏗️ Non-ML
Rust, Go, Remix/React-Router 7, Next.js, Svelte, Tailwind, FastAPI, Shadcn, React.