Back to Glossary
I
Glossary Term

Inference.

Learn what Inference means in modern AI and large language models.

Part of speechnoun

The runtime process of feeding input through a trained AI model to produce an output — distinct from training.

Inference is what happens every time you send a prompt to ChatGPT or any AI tool: the model takes your input, runs the math, and emits a response. Inference is fast (sub-second to a few seconds), uses far less compute than training, and is where almost all production AI cost lives.

For marketers, the relevant fact is that inference is recurring cost. AI features that scale with usage scale with inference spend, which is why most tools cap free tiers and charge per-call.

Ready to close the loop?

See every term in action

Aergos tracks your AI and organic visibility across every channel, in one platform.

Not ready to talk? Audit your site free →