Part 1 – Identifiers

  • What do content identifiers do?
  • They identify content, based on some information that has been provided
  • Shorthand for an exact description of a particular content
  • They don't make assertions about ownership, rights, how the content can be used, etc
  • They can lead to separate, more complete, metadata and descriptions from other commercial and public sources
  • They should be interoperable with many other identification systems—commercial, international standard, industry standard, public/State, private, etc.
  • Why are AV content identifiers important?
  • Metadata is the language of humans to identify things; identifiers are the language of computer-based systems
  • Without reliable identifiers, you (and your system) don't know easily what content and/or specific asset/video/clip/program/etc your catalog/rights document/cue sheet/sale bill/transfer request/encoding order/etc is referring to
  • They make it completely clear which version of something you're talking about – e.g. the German subtitled or the German dubbed version of the original French release or the edited-for-Germany release
  • They can increase AV workflow efficiency
  • They can increase accuracy of AV measurement, tracking, and reporting
  • Although not a right identifier in itself, they make it easier to identify AV works and, therefore, right holders thereof
  • For example
  • They improve efficiency
  • They make it simpler for rights holders, licensees, and distributors to communicate with each other
  • They reduce the costs of moving digital assets through a complex distribution chain- from creation, to post-production, to offer/retailing, to distributing, to reportingand measurement, all moving towards automation
  • They simplify, automate and reduce errors in usage reporting
  • They improve measurement of uptake
  • They improve the management, research & tracking of things in relation to AV works (including rights).
  • They can be used in relationships to group or join things
  • They can be used in conjunction with other identifiers (musical works, sound recordings, books, text, robust commercial metadata sources, etc.)
  • They allow connecting of information from multiple systems (archive catalogs, commercial metadata, distribution points)
  • What are the requirements of an identifier?
  • Unique number referring to unique content within the scope of the ID
  • Adaptability :
  • Coverage at the right level of granularityfor a wide range of use cases
  • Flexibility and extensibility to deal with new circumstancesand use cases
  • Neutral governance structure that gives a balancedvoice to the various communities it is designed to serve
  • Improves stakeholder buy-in
  • Allows reaction to changes in requirements and uses
  • Includes content creators, registrants, users, technologists, application developers, archives, rights holders, etc.
  • Authoritative :
  • Supervision by recognized authority (e.g. ISOor industry-driven standards body)
  • Data supplied by trusted sources :producers, archives, metadata authorities, and other key industry members.
  • High degree of accuracy, and well-understood mechanisms for reporting and fixing errors
  • Persistence :
  • Persistence and maintenance of the standard’sspecifications
  • Persistence of the registry (of identifiers with their corresponding descriptive metadata)
  • Guaranty that ID relates to the same content for perpetuity
  • Interoperability (because there will never be only one identifier)
  • Explicit support of identifiers from other systems or domains
  • Use in other formal and industry specifications (MPEG, DVB, AACS & Blu-Ray, Digital Cinema, CableLabs VOD, Ultraviolet…)
  • Flexible data formats and registration methods
  • Machine to machine interfaces, APIs, and web services for easy access and interoperability
  • Free use and circulation of the ID itself in order to enable applications and support use cases
  • Accessibility of registry
  • Resolve the ID to its associated defining metadata from the central registry
  • As needed for some uses, access to the registry through appropriate mechanisms (APIs, web services, cache copies, …) to enable workflow automation, integration, and other efficiencies
  • Support for external services (extended metadata, distribution, reporting, etc.)
  • Commercially reasonable/nominal costfor issuance, use, implementation
  • What are they used for?
  • Workflow management,workflow automation, distribution
  • rights management (declaration, reporting, payments, …),rights tracking rights clearance
  • cross-referencing, catalogue matching
  • metadata/information retrieval & aggregation, asset retrieval

Current concern:

2 AV content identifiers aiming at the same purpose

  • ISAN and EIDR are both established AV content identifiers
  • ISAN : International Standard Audiovisual Number (ISO 15907)
  • EIDR : Industry standard based on DOI International Standard (ISO 26324)
  • ISAN and EIDR as identifiers have the same purpose (identifying AV works and their versions)
  • But the organisations & technical infrastructure surrounding the identifiers is currently targeted at the needs and priorities of different ecosystems
  • As a result, today the two identifiers are either used differently, used by different communities of stakeholders, used in parallel … or even not used at all.
  • The two identifiers are today complementary rather than in full competition with each other:
  • Different use cases -- Small overlap in implemented use cases
  • Different user bases -- Small overlap in users
  • Possible outcomes
  • Merge the two into a single system that issues only one ID
  • Unlikely in the short term
  • Too much sunk investment by the systems and their sponsors
  • Users can't change immediately
  • Make both do all things for all people
  • Expensive
  • Requires both to expand into areas that are not currentlytheir main expertise
  • Close interoperability, so that if you get one, you automatically get the other
  • Get both IDs from a single registration, where needed
  • For the users, no risk of making a wrong decision
  • Takes advantage of the strengths of both systems

Status

  • Detailed technical discussions are already underway
  • Non-technical discussions (commercial, governance) have started as well
  • Parallel, but interoperable systems,is not ideal, but in the short term:
  • It would be much better than what we have now (pick one or the other, and worry if you did the right thing)
  • It is achievable in a much shorter timescale than other solutions
  • It leaves several possible future paths open for technology, financials, and governance, as well as a unified singular system

Consequences

  • for new IDs
  • Anyone who gets an ISAN can generate anddiscover an EIDR ID when they need it
  • Anyone who gets an EIDR ID can generate and discoveran ISANwhen they need it
  • For previously existing IDs
  • The two systems have to do a matching and gap-filling process between the two systems
  • We are already doing technical analysis on this
  • As a result, no one has to worry about making a bad choice.
  • You can get both at the same time, or know that if you have gotten one you can also get the other when you need it
  • If you have your own identifiers, include them too, to increase interoperability with systems that are not already part of the two identifiers' ecosystems.

Example: ITV

  • Currently getting ISAN
  • With proposed solution, can get EIDR ID as needed for EIDR-based applications

Example: Studio

  • Currently getting EIDR ID and ISANs through separate processes for different applications
  • Under proposed solution can get ISAN and EIDR IDs simultaneously through a single process as needed

Generalization of Examples:

  • Currently must get ISAN and EIDR IDs through separate processes
  • Under proposal, can get both through single process

Part 2 -- Registries:

Identifier registries

  • Identifiers need a registry backing them up
  • To guarantee uniqueness
  • To store the metadata
  • To provide resolution, lookup, etc
  • ISAN and EIDR both have this
  • The “identifier registry” must be only for identification
  • ISAN and EIDR both agree on this : key condition to remain a useful and effective content identifier
  • More detailed databases can be built based on the identifier, but additional metadata shouldn't be in the “identifier registry”
  • Other metadata belongs in its own specialized registries. Examples:
  • Rights databases
  • Commercial information
  • Databases of music tracks in AV works
  • Extended metadata on cast, crew, etc
  • Still images, reviews, scripts
  • This is OK as long as you have cross-referencing and the ability to automate. Example
  • Broadcaster uses a common ID to build its program guide from multiple sources
  • Rights society uses AV identifier as key for a list of rights holders for the work in a particular territory
  • Distributed databases linked through common and broadly used standard identifiers enable rapid evolution and efficient implementation of new ideas
  • Longer term, in some cases operational efficiencies may be realized if specific Registries are combined into a single registry

Rights registry

  • Centralized systems vs Decentralized (distributed) systems
  • Both require identifiers
  • Various ongoing projects and applications :
  • Global Repertoire Database
  • centralized; mainly musical content, but also aiming at specific AV content such as videoclips
  • Linked Content Coalition (LCC)
  • Federated/distributed
  • RDI (the EU implementation project of LCC)
  • DCE (the UK implementation project of LCC.)
  • Safe Creative registry in Spain (safecreative.org)
  • FRAME (see WG1 presentation)
  • The rights registry is as much a legal and social problem as it is a technical one. Especially around
  • Authoritativeness and certainty
  • Completeness
  • Conflict management (this is unavoidable)
  • Trust
  • Critical mass of uptake

Part 3 -- For Archives:

  • There is no reason not to get an identifier, even if the rights to the content are not clear
  • Neither ISAN nor EIDR implies ownership or rights of any sort
  • Choosing one does not preclude using the other
  • Having a standard “external” identifier (often used in connection with specific internal identifiers) can make research of all kinds easier
  • Diligent search (for determining rights, orphan status, etc)
  • Creation of cue sheets for included music
  • Linking to other data sources(editorial content, ancillary material)
  • Scholarly citations
  • Assist automation ofother tasks
  • Digital encoding, digital publication/distribution
  • Discoverability, linked data applications
  • Communication across multiple systems (internal and external)
  • Separation of archival and consumer-facing data
  • parental ratings, language identification, marking, fingerprinting, measurement, etc