Sarvam AI is an Indian multimodal vision-language model excelling in Indian languages and local context Sarvam Vision outperformed global rivals on Indian-language OCR benchmarks with an 84.3% accuracy score The model supports text, image, and audio processing for complex workflows beyond just localisation tasks