1001Ferramentas
๐ŸŽฌValidators

IMDb ID Validator

Validate IMDb ID format (tt followed by 7+ digits).

What is an IMDb ID and why every cinephile workflow ends up using it

The IMDb ID is the canonical unique identifier assigned by the Internet Movie Database to every title, person, company or event in its catalogue. Created in 1990 as a Usenet hobby project and acquired by Amazon in 1998, IMDb now covers more than 11 million titles and 13 million people. Because the catalogue is so large and old, the IMDb ID became the lingua franca across streaming aggregators, recommender systems and academic film studies โ€” even projects that compete with IMDb itself, like TMDb and Letterboxd, store the IMDb ID as a foreign key.

ID format: prefixes and zero-padding

Every IMDb ID is a two-letter type prefix followed by digits. The prefix never changes once issued, so it is safe to use as a routing token:

  • tt + 7 or 8 digits โ€” title (movie, series, episode, short, video game). Example: tt0111161 for The Shawshank Redemption, currently #1 on the IMDb Top 250.
  • nm + digits โ€” name of a person (actor, director, crew). Example: nm0000093 for Brad Pitt.
  • co + digits โ€” company (production, distribution, studio). Example: co0144901 for Globo Filmes.
  • ev + digits โ€” event (award ceremonies, festivals). Example: ev0000003 for the Academy Awards.
  • Less common prefixes: ch characters, ni news items, rg release groups.

Numbers are zero-padded to keep lexicographic and numeric ordering aligned. tt0000001 is Carmencita (1894, Edison studios), the very first title ever entered. New IDs are now well past tt2 million for titles and nm14 million for people, so any robust parser must accept variable length after the prefix.

^(tt|nm|co|ev|ch|ni|rg)[0-9]{7,8}$

Where IMDb IDs power downstream products

  • TMDb (The Movie Database) โ€” community-driven, free API, mirrors the IMDb ID in external_ids.imdb_id. Used by Plex, Jellyfin, Stremio, Radarr/Sonarr.
  • JustWatch โ€” Brazilian streaming search uses IMDb IDs to query Netflix, Prime Video, Globoplay, Max, Apple TV+, Disney+.
  • Trakt.tv โ€” keeps personal viewing history keyed by IMDb ID.
  • Letterboxd โ€” film social network whose database is built on TMDb and stores the IMDb ID for export and cross-reference.
  • Wikidata / Wikipedia โ€” property P345 stores the IMDb ID for every notable film entry, allowing SPARQL queries across film history.
  • Academic datasets โ€” IMDb publishes a free non-commercial dataset (title.basics.tsv.gz, name.basics.tsv.gz) widely used in recommendation system research.

APIs you should know about

IMDb itself does not offer a free public API. Three legitimate paths exist:

  • OMDb API (omdbapi.com) โ€” popular community wrapper, 1,000 free requests per day, accepts IMDb IDs directly as i=tt0111161.
  • TMDb API โ€” fully free for non-commercial use, multi-language, far richer metadata; endpoint /3/find/{imdb_id}?external_source=imdb_id bridges the two databases.
  • IMDb non-commercial datasets โ€” bulk TSV files refreshed daily, ideal for offline analytics. Commercial licence required for paid products.

Scraping the IMDb HTML site is technically possible but actively discouraged: aggressive rate limiting, captchas and Terms of Service prohibit it. APIs are always the safer route.

IMDb rating math and how it differs from other scores

The IMDb headline rating is a weighted average of all user votes, scaled 1โ€“10. To enter the Top 250, however, a film must clear a quota of votes (currently above 25,000) and is recalculated by a Bayesian formula:

WR = (v / (v + m)) * R + (m / (v + m)) * C

where R = title's average, v = votes received, m = minimum votes required, C = mean rating across all eligible titles. This dampens low-vote outliers and is why a fresh blockbuster rarely overtakes The Shawshank Redemption. Other scoring systems use very different math: Rotten Tomatoes Tomatometer is the percentage of critics with a "positive" verdict (a binary judgement aggregated as a ratio), while Metacritic computes a weighted average of critic scores normalised to 0โ€“100.

Brazilian cinema on IMDb

Brazilian films thread through IMDb thanks to international festival releases. Notable IDs: tt0317248 Cidade de Deus, tt0861739 Tropa de Elite, tt1098327 Tropa de Elite 2, tt0118842 Central do Brasil, tt21807222 Ainda Estou Aqui (2024). Streaming partners Globoplay, Crunchyroll BR and the local catalogues of Netflix and Prime Video all link out to IMDb IDs for cross-promotion and SEO.

FAQ

What is the difference between tt and nm? tt identifies a title (a film, series, episode), nm identifies a name (a person). They never overlap, so a single regex can route requests by prefix.

Is there a free IMDb API? Not directly. The closest free alternatives are OMDb (1k req/day) and TMDb (unlimited non-commercial). Both accept the IMDb ID as a lookup key.

Can the ID have fewer than 7 digits? Officially yes for very old entries (tt0000001), but always zero-padded to at least 7. Modern allocations are 7 or 8 digits.

Is IMDb data reliable? Partially. It is crowdsourced and editors approve changes, but biases (vote brigading, premature ratings) exist. For academic work prefer the snapshot datasets and timestamp your queries.

How do I find an IMDb ID from a title string? The lookup endpoint of OMDb or TMDb is the easiest. Both accept fuzzy title queries and return the canonical IMDb ID.

Related Tools