Frontier AI models compete on capability and efficiency
StepFun's Step 3.7 Flash launches on NVIDIA GPUs for enterprise multimodal AI
LLM
StepFun released Step 3.7 Flash, a 198-billion-parameter vision-language model optimized for enterprise workflows, now deployable on NVIDIA infrastructure via TensorRT-LLM, SGLang, and vLLM with a 256k context window and native image/video support. Commerce teams can leverage this for document intelligence, financial analysis, and concurrent agentic workflows with production-ready deployment through NVIDIA NIM and Day 0 fine-tuning via NeMo Framework.
View full article →