Skip to content

Experimental tagging schema for analysis of intersectionality, bias, and reception theories.

License

Notifications You must be signed in to change notification settings

mmrl-me/mmrl-metadata

Repository files navigation

mmrl-metadata

Intro

Experimental schema for describing diversity and identity for individuals in and casts of media objects. Where media object is defined as a particular media instance: a film, a season of a netflix show, an episode of a broadcast television series, a software release, an album release, a leak of a CDC report, a document containing a year of presidential tax reports, etc.

This schema is in development, is considered experimental, and will remain versioned yet stubbornly unfixed.

Design Goals

  • Enumerate six dimensions of identity: age, sexuality, gender, nationality, ethnicity, and "z" or transhuman/human/meta categories
  • Enumerate a useful but minimal set of tags for describing each of the identity dimensions
  • Enumerate tags for common biases
  • Represent Bechdel, Villalobos, Gagen, DuVernay tests. With additional tagging to quantify or infer “diverse ensemble,” “fully realized lives”, “scenery in white lives” tags.
  • Provide a serialized form of metadata for each media object; useful for creating in-memory objects at runtime
  • Support for mechanical/computational sorting and organizing of media objects by streaming platform, genre, and cast diversity
  • Used to visualize individual character/role dimensions
  • Used to visualize multiple individuals as a cast of a specific media object
  • Used to visualize multiple media objects in aggregate
  • Use major/minor tagging grammars like "20-29 playing 30-39" to represent complexity
  • Explore use of lean data practices with "just enough" data to ask pre-defined data questions

Background


Schema Description

{
"name": "",
"name-key": "",
"name-key-sub": "",
"name-key-canonical": "",
"type": "",
"id": "",

"id-imdb": "",
"id-wikipedia": "",

"genre-tags": [""],
"bias-tags": [""],

"plot-tags": [""],
"plot-geo": [""],
"plot-date": [""],

"distribution-tags": [""],

"production-tags": [""],
"production-geo": [""],

"reception-positive-tags": [""],
"reception-negative-tags": [""],

"cast-lead-1-ethnicities": [""],
"cast-lead-1-nationalities": [""],
"cast-lead-1-genders": [""],
"cast-lead-1-sexualities": [""],
"cast-lead-1-ages": [""],
"cast-lead-1-z": [""],
"cast-lead-1-name": [""],

"cast-lead-2-ethnicities": [""],
"cast-lead-2-nationalities": [""],
"cast-lead-2-genders": [""],
"cast-lead-2-sexualities": [""],
"cast-lead-2-ages": [""],
"cast-lead-2-z": [""],
"cast-lead-2-name": [""],

"cast-lead-3-ethnicities": [""],
"cast-lead-3-nationalities": [""],
"cast-lead-3-genders": [""],
"cast-lead-3-sexualities": [""],
"cast-lead-3-ages": [""],
"cast-lead-3-z": [""],
"cast-lead-3-name": [""],

"cast-lead-4-ethnicities": [""],
"cast-lead-4-nationalities": [""],
"cast-lead-4-genders": [""],
"cast-lead-4-sexualities": [""],
"cast-lead-4-ages": [""],
"cast-lead-4-z": [""],
"cast-lead-4-name": [""],    

"cast-lead-5-ethnicities": [""],
"cast-lead-5-nationalities": [""],
"cast-lead-5-genders": [""],
"cast-lead-5-sexualities": [""],
"cast-lead-5-ages": [""],
"cast-lead-5-z": [""],
"cast-lead-5-name": [""],

"cast-lead-6-ethnicities": [""],
"cast-lead-6-nationalities": [""],
"cast-lead-6-genders": [""],
"cast-lead-6-sexualities": [""],
"cast-lead-6-ages": [""],
"cast-lead-6-z": [""],
"cast-lead-6-name": [""],    

"cast-lead-7-ethnicities": [""],
"cast-lead-7-nationalities": [""],
"cast-lead-7-genders": [""],
"cast-lead-7-sexualities": [""],
"cast-lead-7-ages": [""],
"cast-lead-7-z": [""],
"cast-lead-7-name": [""],

"cast-lead-8-ethnicities": [""],
"cast-lead-8-nationalities": [""],
"cast-lead-8-genders": [""],
"cast-lead-8-sexualities": [""],
"cast-lead-8-ages": [""],
"cast-lead-8-z": [""],
"cast-lead-8-name": [""],

"notes": [""],

"data-version": "20210317"
}

  • name what the media property calls itself: identifier with capitalization and typographic flourish, such as "Westworld" or "#blackAF"
  • name-key machine generated key to hydrate this persistent representation to an in-memory a60::collection object. Don't change this.
  • name-key-sub optional machine generated key for subsetting collections to a particular episode or title. Don't change this.
  • type media characteristics, such as "film", "serial", "software", "book"
  • id optional numeric identity. If film, nothing, if serial either the season, such as "4" or the season and episode in compressed format, where 301 means SSEE of Season 3 Episode 1, leading zero of season dropped
  • id-imdb the root page of the media property's IMDB.com entry
  • id-wikipedia the root page of the media property's wikipedia.org entry
  • genre-tags union of alpha 60 genre rubric, imdb genres, wikipedia genres, and "diverse ensembles"
  • plot-tags union of alpha 60 plot tagging rubric, imdb plot keywords
  • plot-geo ostensible geography of media property
  • plot-date ostensible time period, deduced from media property, with multiple time periods separated by comma if be
  • cast-lead-1..8-ethnicities lead ethnicity, comma separated list
  • cast-lead-1..8-nationalities lead nationality, comma separated list
  • cast-lead-1..8-genders lead gender, comma separated list
  • cast-lead-1..8-sexualities lead sexuality, comma separated list
  • cast-lead-1..8-ages lead age deciles, comma separated list
  • cast-lead-1..8-z lead form: animal, vegetable, mineral, spirit or other, comma separated list
  • distribution-tags wikipedia distributor, such as "netflix" or "disney+" or "cbs all access"
  • production-tags optional, unknown if useful, put prouduction overflow here
  • production-geo union wikipedia country of origin and imdb production details locations
  • notes optional, overflow and space for custom or prototype tagging
  • data-version last time new data fields were added, in compressed ISO date format, where 20200524 means YYYY-MM-DD of 2020-05-24. Don't change this.