A state-of-the-art 12B model with 128k context length, built by Mistral AI in collaboration with NVIDIA.