Part 1 – Identifiers
- What do content identifiers do?
- They identify content, based on some information that has been provided
- Shorthand for an exact description of a particular content
- They don't make assertions about ownership, rights, how the content can be used, etc
- They can lead to separate, more complete, metadata and descriptions from other commercial and public sources
- They should be interoperable with many other identification systems—commercial, international standard, industry standard, public/State, private, etc.
- Why are AV content identifiers important?
- Metadata is the language of humans to identify things; identifiers are the language of computer-based systems
- Without reliable identifiers, you (and your system) don't know easily what content and/or specific asset/video/clip/program/etc your catalog/rights document/cue sheet/sale bill/transfer request/encoding order/etc is referring to
- They make it completely clear which version of something you're talking about – e.g. the German subtitled or the German dubbed version of the original French release or the edited-for-Germany release
- They can increase AV workflow efficiency
- They can increase accuracy of AV measurement, tracking, and reporting
- Although not a right identifier in itself, they make it easier to identify AV works and, therefore, right holders thereof
- For example
- They improve efficiency
- They make it simpler for rights holders, licensees, and distributors to communicate with each other
- They reduce the costs of moving digital assets through a complex distribution chain- from creation, to post-production, to offer/retailing, to distributing, to reportingand measurement, all moving towards automation
- They simplify, automate and reduce errors in usage reporting
- They improve measurement of uptake
- They improve the management, research & tracking of things in relation to AV works (including rights).
- They can be used in relationships to group or join things
- They can be used in conjunction with other identifiers (musical works, sound recordings, books, text, robust commercial metadata sources, etc.)
- They allow connecting of information from multiple systems (archive catalogs, commercial metadata, distribution points)
- What are the requirements of an identifier?
- Unique number referring to unique content within the scope of the ID
- Adaptability :
- Coverage at the right level of granularityfor a wide range of use cases
- Flexibility and extensibility to deal with new circumstancesand use cases
- Neutral governance structure that gives a balancedvoice to the various communities it is designed to serve
- Improves stakeholder buy-in
- Allows reaction to changes in requirements and uses
- Includes content creators, registrants, users, technologists, application developers, archives, rights holders, etc.
- Authoritative :
- Supervision by recognized authority (e.g. ISOor industry-driven standards body)
- Data supplied by trusted sources :producers, archives, metadata authorities, and other key industry members.
- High degree of accuracy, and well-understood mechanisms for reporting and fixing errors
- Persistence :
- Persistence and maintenance of the standard’sspecifications
- Persistence of the registry (of identifiers with their corresponding descriptive metadata)
- Guaranty that ID relates to the same content for perpetuity
- Interoperability (because there will never be only one identifier)
- Explicit support of identifiers from other systems or domains
- Use in other formal and industry specifications (MPEG, DVB, AACS & Blu-Ray, Digital Cinema, CableLabs VOD, Ultraviolet…)
- Flexible data formats and registration methods
- Machine to machine interfaces, APIs, and web services for easy access and interoperability
- Free use and circulation of the ID itself in order to enable applications and support use cases
- Accessibility of registry
- Resolve the ID to its associated defining metadata from the central registry
- As needed for some uses, access to the registry through appropriate mechanisms (APIs, web services, cache copies, …) to enable workflow automation, integration, and other efficiencies
- Support for external services (extended metadata, distribution, reporting, etc.)
- Commercially reasonable/nominal costfor issuance, use, implementation
- What are they used for?
- Workflow management,workflow automation, distribution
- rights management (declaration, reporting, payments, …),rights tracking rights clearance
- cross-referencing, catalogue matching
- metadata/information retrieval & aggregation, asset retrieval
Current concern:
2 AV content identifiers aiming at the same purpose
- ISAN and EIDR are both established AV content identifiers
- ISAN : International Standard Audiovisual Number (ISO 15907)
- EIDR : Industry standard based on DOI International Standard (ISO 26324)
- ISAN and EIDR as identifiers have the same purpose (identifying AV works and their versions)
- But the organisations & technical infrastructure surrounding the identifiers is currently targeted at the needs and priorities of different ecosystems
- As a result, today the two identifiers are either used differently, used by different communities of stakeholders, used in parallel … or even not used at all.
- The two identifiers are today complementary rather than in full competition with each other:
- Different use cases -- Small overlap in implemented use cases
- Different user bases -- Small overlap in users
- Possible outcomes
- Merge the two into a single system that issues only one ID
- Unlikely in the short term
- Too much sunk investment by the systems and their sponsors
- Users can't change immediately
- Make both do all things for all people
- Expensive
- Requires both to expand into areas that are not currentlytheir main expertise
- Close interoperability, so that if you get one, you automatically get the other
- Get both IDs from a single registration, where needed
- For the users, no risk of making a wrong decision
- Takes advantage of the strengths of both systems
Status
- Detailed technical discussions are already underway
- Non-technical discussions (commercial, governance) have started as well
- Parallel, but interoperable systems,is not ideal, but in the short term:
- It would be much better than what we have now (pick one or the other, and worry if you did the right thing)
- It is achievable in a much shorter timescale than other solutions
- It leaves several possible future paths open for technology, financials, and governance, as well as a unified singular system
Consequences
- for new IDs
- Anyone who gets an ISAN can generate anddiscover an EIDR ID when they need it
- Anyone who gets an EIDR ID can generate and discoveran ISANwhen they need it
- For previously existing IDs
- The two systems have to do a matching and gap-filling process between the two systems
- We are already doing technical analysis on this
- As a result, no one has to worry about making a bad choice.
- You can get both at the same time, or know that if you have gotten one you can also get the other when you need it
- If you have your own identifiers, include them too, to increase interoperability with systems that are not already part of the two identifiers' ecosystems.
Example: ITV
- Currently getting ISAN
- With proposed solution, can get EIDR ID as needed for EIDR-based applications
Example: Studio
- Currently getting EIDR ID and ISANs through separate processes for different applications
- Under proposed solution can get ISAN and EIDR IDs simultaneously through a single process as needed
Generalization of Examples:
- Currently must get ISAN and EIDR IDs through separate processes
- Under proposal, can get both through single process
Part 2 -- Registries:
Identifier registries
- Identifiers need a registry backing them up
- To guarantee uniqueness
- To store the metadata
- To provide resolution, lookup, etc
- ISAN and EIDR both have this
- The “identifier registry” must be only for identification
- ISAN and EIDR both agree on this : key condition to remain a useful and effective content identifier
- More detailed databases can be built based on the identifier, but additional metadata shouldn't be in the “identifier registry”
- Other metadata belongs in its own specialized registries. Examples:
- Rights databases
- Commercial information
- Databases of music tracks in AV works
- Extended metadata on cast, crew, etc
- Still images, reviews, scripts
- This is OK as long as you have cross-referencing and the ability to automate. Example
- Broadcaster uses a common ID to build its program guide from multiple sources
- Rights society uses AV identifier as key for a list of rights holders for the work in a particular territory
- Distributed databases linked through common and broadly used standard identifiers enable rapid evolution and efficient implementation of new ideas
- Longer term, in some cases operational efficiencies may be realized if specific Registries are combined into a single registry
Rights registry
- Centralized systems vs Decentralized (distributed) systems
- Both require identifiers
- Various ongoing projects and applications :
- Global Repertoire Database
- centralized; mainly musical content, but also aiming at specific AV content such as videoclips
- Linked Content Coalition (LCC)
- Federated/distributed
- RDI (the EU implementation project of LCC)
- DCE (the UK implementation project of LCC.)
- Safe Creative registry in Spain (safecreative.org)
- FRAME (see WG1 presentation)
- …
- The rights registry is as much a legal and social problem as it is a technical one. Especially around
- Authoritativeness and certainty
- Completeness
- Conflict management (this is unavoidable)
- Trust
- Critical mass of uptake
Part 3 -- For Archives:
- There is no reason not to get an identifier, even if the rights to the content are not clear
- Neither ISAN nor EIDR implies ownership or rights of any sort
- Choosing one does not preclude using the other
- Having a standard “external” identifier (often used in connection with specific internal identifiers) can make research of all kinds easier
- Diligent search (for determining rights, orphan status, etc)
- Creation of cue sheets for included music
- Linking to other data sources(editorial content, ancillary material)
- Scholarly citations
- Assist automation ofother tasks
- Digital encoding, digital publication/distribution
- Discoverability, linked data applications
- Communication across multiple systems (internal and external)
- Separation of archival and consumer-facing data
- parental ratings, language identification, marking, fingerprinting, measurement, etc