Glossary · sourc.dev

Pricing & economics

#1 You already know this

Token

You're probably spending more than you need to — here's how to fix it

#3 Worked example

Input price

This is the number your unit economics are built on

#4 Honest two sides

Output price

It is always higher than input — knowing why helps you choose

#5 Worked example

Per 1M tokens

Once you understand this unit, you can price any model in under a minute

#10 Honest two sides

Free tier

Free has a ceiling — knowing where it is means no surprise invoice

#18 Worked example

Rate limit

Hit this in production at 2am and you will never forget it exists

#26 Analogy first

Context caching

You might be paying full price for tokens the provider already has in memory

#27 Analogy first

Batch pricing

Some API calls cost half as much if you can wait

#28 Analogy first

Price per request

The number your CFO actually cares about

#29 Analogy first

Overage

The cost you did not plan for

#46 Analogy first

Batch API

Half price, same model, no latency guarantee

#48 Analogy first

Cost per query

The number your budget actually depends on

Architecture & capabilities

#2 Analogy first

Context window

It determines what your product can actually do — and what it cannot

#6 Before / after

Function calling

This is what turns a chatbot into a product that actually does things

#7 Before / after

Vision / image input

Send the image and let the model read it

#8 A to B

Streaming (SSE)

The difference between a product that feels alive and one that feels broken

#9 Before / after

Model Context Protocol (MCP)

Why AI tools suddenly started working together

#22 Analogy first

RAG

Give a model your data without retraining it — at a fraction of fine-tuning cost

#30 Analogy first

Max output tokens

Your output might be getting silently cut off

#31 Analogy first

Tool use

The feature that turns a chatbot into a software agent

#32 Analogy first

System prompt

The instruction layer your users never see

#33 Analogy first

Grounding

The difference between a model that guesses and one that cites

#34 Analogy first

Prompt engineering

Get better results from the same model at the same price

#35 Analogy first

Temperature

The knob that controls creativity vs consistency

#39 Analogy first

Agents

The word everyone uses and almost nobody defines precisely

#41 Analogy first

Structured output

Get JSON, not paragraphs

Models & training

#19 You already know this

LLM

The term behind everything on this site — and what it actually means

#20 Honest two sides

Open weights

Run it yourself or pay forever — this is the real infrastructure decision

#21 Honest two sides

Hallucination

Understanding why it happens tells you how to reduce it

#23 Honest two sides

Fine-tuning

When RAG is not enough — knowing the difference saves you months

#25 Worked example

Model parameters

The number in the model name — what 70B actually means

#40 Analogy first

Multimodal

Models that see, hear, and read — and what that costs

#42 Analogy first

Reasoning models

The models that think before they answer

#44 Analogy first

Model family

Why "Claude" means six different things

#45 Analogy first

Quantisation

Run a 70B model on a laptop

Infrastructure & integration

#14 You already know this

API

Everything you build on top of AI runs through this

#15 Analogy first

REST API

Recognise this pattern once and you understand most of the internet

#16 You already know this

API key

Leak this once and you will understand why everyone warns about it

#17 Honest two sides

SDK

Saves you hours — or locks you in. Know which before you choose.

#24 A to B

Latency

Your users measure seconds, not tokens

#36 Analogy first

API endpoint

The URL your application talks to

#37 Analogy first

Webhook

Get notified when something changes — without polling

#38 Analogy first

Throughput

How many requests your provider can actually handle

#47 Analogy first

Async vs sync

The architecture decision behind every API call

Benchmarks & evaluation

#12 Worked example

MMLU

Everyone cites this score — understanding what it measures helps you decide how …

#13 Worked example

HumanEval

Before you trust a model with your code, know what this test contains

#43 Analogy first

AI benchmarks

The numbers everyone cites and almost nobody understands

#50 Analogy first

Benchmark gaming

Why the highest score might not be the best model

Compliance & trust

#11 Analogy first

EU data residency

One wrong choice here and your contract is void

#49 Analogy first

GDPR

The regulation that determines where your AI data lives