AI Glossary

Foundation Model

A massive AI model trained on vast amounts of data, adaptable to a wide range of tasks.

TL;DR

  • A massive AI model trained on vast amounts of data, adaptable to a wide range of tasks.
  • Foundation Model shapes how organizations design controls, ownership, and operating discipline around AI.
  • Use the related terms and explanation below to connect the definition to real enterprise rollout decisions.

In Depth

A Foundation Model is a large-scale artificial intelligence model—typically a neural network with billions of parameters—that is trained on a massive quantity of unlabeled data at scale. Unlike narrow AI models designed to do one specific thing (like identifying cats in photos or predicting housing prices), foundation models are generalists. Because they have developed a deep, underlying representation of human language and logic during their initial training, they can be adapted (e.g., through prompting or fine-tuning) to perform a wide variety of downstream tasks they were not explicitly programmed to do.

Prominent examples of foundation models include OpenAI's GPT-4, Anthropic's Claude 3, and Meta's Llama 3. For the enterprise, foundation models represent a massive paradigm shift. Instead of building a separate machine learning model for sentiment analysis, another for translation, and another for summarization, an enterprise can route all of these tasks through a single foundation model via an API. This drastically reduces the time-to-market for new AI capabilities.

However, relying on foundation models introduces vendor lock-in and data privacy concerns. Because training a foundation model costs tens of millions of dollars, most enterprises must rent access to them via cloud APIs. A robust Model Governance strategy is required to ensure that the enterprise can dynamically route traffic between different foundation models based on cost, capability, and changing Terms of Service, preventing reliance on a single vendor.

Free Resource

The 1-Page AI Safety Sheet

Print this, pin it next to every screen. 10 rules your team should follow every time they use AI at work.

You get

A printable 1-page PDF with 10 clear do's and don'ts for AI use.

Free Resource

Get a Draft AI Policy in 5 Minutes

Answer 6 questions about your company. Get a real AI usage policy you can hand to legal this week.

You get

A ready-to-review AI policy document customized to your company.

Knowledge Hub

Glossary FAQs

An LLM (Large Language Model) is a specific type of foundation model trained exclusively on text. A foundation model is a broader term that can also include models trained on images, audio, or video (multimodal models).
For 99% of enterprises, no. Training a foundation model from scratch requires massive data centers, thousands of GPUs, and millions of dollars. Enterprises should instead consume existing <a href='/glossary/foundation-model'><a href='/glossary/foundation-model'>foundation models</a></a> via APIs or host open-source <a href='/glossary/foundation-model'>foundation models</a> internally.
By utilizing an AI Gateway (like Remova) that abstracts the API layer. Instead of hard-coding your applications to the OpenAI API, you connect to the gateway, which can dynamically route your prompts to OpenAI, Google, or Anthropic without changing your core code.

ENTERPRISE AI GOVERNANCE

Turn glossary concepts like Foundation Model into enforceable operating controls with Remova.

Sign Up