Groq™ First to Achieve 100 Tokens Per Second Per User on Meta AI’s Llama-2 70B, Leading All Artificial Intelligence Solutions Providers in Inference Performance

MOUNTAIN VIEW, Calif., Aug. 8, 2023 /PRNewswire/ — Groq, an artificial intelligence (AI) solutions provider, today announced it now runs the Large Language Model (LLM), Llama-2 70B, at more than 100 tokens per second (T/s) per user on a Groq LPU™, the newly defined category for Groq…