DeepSeek achieves 10x AI training efficiency using NVIDIA's PTX instead of CUDA, showcasing breakthrough optimization but with complex tradeoffs.