According to a recent LinkedIn post from FriendliAI, the company is emphasizing the availability of Gemma-4-31B-it on its Model APIs and Dedicated Endpoints, positioning it as a high-performance inference option for enterprise use. The post highlights that this version of Google DeepMind’s Gemma 4 family is a dense, instruction-tuned, multimodal vision-language model with configurable thinking and native function calling.
Meet Samuel – Your Personal Investing Prophet
- Start a conversation with TipRanks’ trusted, data-backed investment intelligence
- Ask Samuel about stocks, your portfolio, or the market and get instant, personalized insights in seconds
The post suggests that FriendliAI’s implementation ranks highly on the Artificial Analysis leaderboard for output speed, time-to-first-token, and end-to-end response time. Benchmark results cited include performance in agentic coding, math and reasoning, and document understanding tasks, indicating a focus on workloads such as coding agents, document extraction, and question answering.
From an investor perspective, this emphasis on speed and benchmark leadership may signal FriendliAI’s intent to compete as a specialized inference provider for advanced open-weight models. Strong performance on third-party evaluations could enhance the company’s credibility with developers and enterprise customers seeking cost-effective alternatives to proprietary LLMs.
If sustained, this positioning could support user growth on FriendliAI’s API platform and improve monetization through higher-value workloads in AI agents and data-intensive applications. However, the post does not disclose pricing, adoption metrics, or revenue impact, so the financial implications remain uncertain and depend on the extent to which these technical advantages translate into commercial traction.

