What Is Caching?

Fetching data from a database is slow. It involves disk I/O. It involves network hops.

Caching is the cheat code that makes it fast.

Simple Definition

Caching is the process of storing copies of files or data in a temporary storage location so they can be accessed more quickly.

It is based on the idea that if you asked for something once you will probably ask for it again soon.

Imagine a librarian. Without a cache she walks to the basement archive for every book request. With a cache she keeps the most popular books on her desk.

Storing data in temporary high-speed access

The cache is smaller and faster than the main database. It usually runs in memory (RAM) rather than on disk.

When an application needs data it checks the cache first. This is a “Cache Hit” (fast). If the data isn’t there it checks the database. This is a “Cache Miss” (slow).

Types of Caching

Caching happens everywhere.

Browser Cache: Your Chrome browser saves images so it doesn’t download the logo on every page refresh.
CDN (Content Delivery Network): Servers around the world store copies of your video files closer to the user.
Application Cache: Tools like Redis or Memcached store database query results in RAM.

Visualizing Caching

In a system design interview knowing where to put the cache is critical.

Placing a node between App and DB

In a System Architecture Diagram the cache sits between the Application Server and the Database.

You draw an arrow from App to Cache labeled “1. Check.” You draw an arrow from App to DB labeled “2. Fallback.”

This visual placement highlights the strategy: Protect the database.

To master performance you need these terms:

Latency: The time it takes to fetch data. Caching reduces latency.
Eviction Policy: The rule for deciding what to delete when the cache is full (e.g. LRU: Least Recently Used).
TTL (Time To Live): How long data stays in the cache before it expires.

For more on designing high-performance systems check out our System Design Guide.

What Is Caching?

Simple Definition

Storing data in temporary high-speed access

Types of Caching

Visualizing Caching

Placing a node between App and DB

Related Posts

Cracking the System Design Interview: Visualizing Solutions in Real-Time

Mastering System Design with AI: From Interviews to Architecture Reviews

Scenario: Designing a URL Shortener (Interview Question)

AI System Architecture Generator: High-Level Design (HLD) in Seconds

Simple Definition

Storing data in temporary high-speed access

Types of Caching

Visualizing Caching

Placing a node between App and DB

Related Terms

Related Posts

Cracking the System Design Interview: Visualizing Solutions in Real-Time

Mastering System Design with AI: From Interviews to Architecture Reviews

Scenario: Designing a URL Shortener (Interview Question)

AI System Architecture Generator: High-Level Design (HLD) in Seconds