Jump to content

Digital object identifier

From Wikipedia, the free encyclopedia

Digital object identifier
Full nameDigital Object Identifier
AcronymDOI
OrganisationInternational DOI Foundation
IntroducedOctober 1997; 28 years ago (1997-10)[1]
Example10.1000/182
Websitewww.doi.org/the-identifier/what-is-a-doi/ Edit this at Wikidata

A digital object identifier (DOI) is a persistent identifier or handle used to uniquely identify various objects, standardized by the International Organization for Standardization (ISO).[2] DOIs are an implementation of the Handle System;[3][4] they also fit within the URI system (Uniform Resource Identifier). They are widely used to identify academic, professional, and government information, such as journal articles, research reports, data sets, and official publications.

A DOI aims to resolve to its target, the information object to which the DOI refers. This is achieved by binding the DOI to metadata about the object, such as a URL where the object is located. Thus, by being actionable and interoperable, a DOI differs from ISBNs or ISRCs which are identifiers only. The DOI system uses the indecs Content Model to represent metadata.

The DOI for a document remains fixed over the lifetime of the document, whereas its location and other metadata may change. Referring to an online document by its DOI should provide a more stable link than directly using its URL. But if its URL changes, the publisher must update the metadata for the DOI to maintain the link to the URL.[5][6][7] It is the publisher's responsibility to update the DOI database. If they fail to do so, the DOI resolves to a dead link, leaving the DOI useless.[8]

The developer and administrator of the DOI system is the International DOI Foundation (IDF), which introduced it in 2000.[9] Organizations that meet the contractual obligations of the DOI system and are willing to pay to become a member of the system can assign DOIs.[10] The DOI system is implemented through a federation of registration agencies coordinated by the IDF.[11] The cumulative number of DOIs has increased exponentially over time, from 50 million registrations in 2011 to 391 million in 2025.[12] The rate of registering organizations ("members") has also increased over time from 4,000 in 2011 to 9,500 in 2013, but the federated nature of the system means it is not immediately clear how many members there are in total today.[13] Fake registries have even appeared.[14]

Nomenclature and syntax

[edit]

A DOI is a type of Handle System handle, which takes the form of a character string divided into two parts, a prefix and a suffix, separated by a slash.

prefix/suffix

The prefix identifies the registrant of the identifier and the suffix is chosen by the registrant and identifies the specific object associated with that DOI. Most legal Unicode characters are allowed in these strings, which are interpreted in a case-insensitive manner. The prefix usually takes the form 10.NNNN, where NNNN is a number greater than or equal to 1000, whose limit depends only on the total number of registrants.[15][16] The prefix may be further subdivided with periods, like 10.NNNN.N.[17]

For example, in the DOI name 10.1000/182, the prefix is 10.1000 and the suffix is 182. The "10" part of the prefix distinguishes the handle as part of the DOI namespace, as opposed to some other Handle System namespace,[A] and the characters 1000 in the prefix identify the registrant; in this case the registrant is the International DOI Foundation itself. 182 is the suffix, or item ID, identifying a single object (in this case, the latest version of the DOI Handbook).

DOI names can identify creative works (such as texts, images, audio or video items, and software) in both electronic and physical forms, performances, and abstract works[18] such as licenses, parties to a transaction, etc.

The names can refer to objects at varying levels of detail: thus DOI names can identify a journal, an individual issue of a journal, an individual article in the journal, or a single table in that article. The choice of level of detail is left to the assigner, but in the DOI system it must be declared as part of the metadata that is associated with a DOI name, using a data dictionary based on the indecs Content Model.

Display

[edit]

The official DOI Handbook explicitly states that DOIs should be displayed on screens and in print in the format doi:10.1000/182.[19]

Contrary to the DOI Handbook, Crossref, a major DOI registration agency, recommends displaying a URL (for example, https://doi.org/10.1000/182) instead of the officially specified format.[20][21]. The DOI Foundation guarantees these URLs to be persistent[22] ie. such URLs are PURLs — providing the location of a name resolver which will redirect HTTP requests to the correct online location of the linked item.[10][23]

The Crossref recommendation is primarily based on the assumption that the DOI is being displayed without being hyperlinked to its appropriate URL—the argument being that without the hyperlink it is not as easy to copy-and-paste the full URL to actually bring up the page for the DOI, thus the entire URL should be displayed, allowing people viewing the page containing the DOI to copy-and-paste the URL, by hand, into a new window/tab in their browser in order to go to the appropriate page for the document the DOI represents.[24]

Content

[edit]

Major content of the DOI system currently includes:

In the Organisation for Economic Co-operation and Development's publication service OECD iLibrary, each table or graph in an OECD publication is shown with a DOI name that leads to an Excel file of data underlying the tables and graphs. Further development of such services is planned.[26]

Other registries include Crossref and the multilingual European DOI Registration Agency (mEDRA).[27] Since 2015, RFCs can be referenced as doi:10.17487/rfc....[28]

Features and benefits

[edit]

The IDF designed the DOI system to provide persistent identification. Each DOI name permanently and clearly identifies the object it belongs to (although when the publisher of a journal changes, sometimes all the DOIs will be changed, with the old DOIs no longer working). It also associates metadata with objects, allowing it to provide users with relevant pieces of information about the objects and their relationships. Included as part of this metadata are network actions that allow DOI names to be resolved to web locations where the objects they describe can be found. To achieve its goals, the DOI system combines the Handle System and the indecs Content Model with a social infrastructure.

The Handle System ensures that the DOI name for an object is not based on any changeable attributes of the object such as its physical location or ownership, that the attributes of the object are encoded in its metadata rather than in its DOI name, and that no two objects are assigned the same DOI name. Because DOI names are short character strings, they are human-readable, may be copied and pasted as text, and fit into the URI specification. The DOI name-resolution mechanism acts behind the scenes, so that users communicate with it in the same way as with any other web service; it is built on open architectures, incorporates trust mechanisms, and is engineered to operate reliably and flexibly so that it can be adapted to changing demands and new applications of the DOI system.[29] DOI name-resolution may be used with OpenURL to select the most appropriate among multiple locations for a given object, according to the location of the user making the request.[30] However, despite this ability, the DOI system has drawn criticism from librarians for directing users to non-free copies of documents, that would have been available for no additional fee from alternative locations.[31]

The indecs Content Model as used within the DOI system associates metadata with objects. A small kernel of common metadata is shared by all DOI names and can be optionally extended with other relevant data, which may be public or restricted. Registrants may update the metadata for their DOI names at any time, such as when publication information changes or when an object moves to a different URL.

The International DOI Foundation (IDF) oversees the integration of these technologies and operation of the system through a technical and social infrastructure. The social infrastructure of a federation of independent registration agencies offering DOI services was modelled on existing successful federated deployments of identifiers such as GS1 and ISBN.

Comparison with other identifier schemes

[edit]

A DOI name differs from commonly used Internet pointers to material, such as the Uniform Resource Locator (URL), in that it identifies an object itself as a first-class entity, rather than the specific place where the object is located at a certain time. It implements the Uniform Resource Identifier (Uniform Resource Name) concept and adds to it a data model and social infrastructure.[32]

A DOI name also differs from standard identifier registries such as the ISBN, ISRC, etc. The purpose of an identifier registry is to manage a given collection of identifiers, whereas the primary purpose of the DOI system is to make a collection of identifiers actionable and interoperable, where that collection can include identifiers from many other controlled collections.[33]

The DOI system offers persistent, semantically interoperable resolution to related current data and is best suited to material that will be used in services outside the direct control of the issuing assigner (e.g., public citation or managing content of value). It uses a managed registry (providing both social and technical infrastructure). It does not assume any specific business model for the provision of identifiers or services and enables other existing services to link to it in defined ways. Several approaches for making identifiers persistent have been proposed.

The comparison of persistent identifier approaches is difficult because they are not all doing the same thing. Imprecisely referring to a set of schemes as "identifiers" does not mean that they can be compared easily. Other "identifier systems" may be enabling technologies with low barriers to entry, providing an easy to use labeling mechanism that allows anyone to set up a new instance (examples include Persistent Uniform Resource Locator (PURL), URLs, Globally Unique Identifiers (GUIDs), etc.), but may lack some of the functionality of a registry-controlled scheme and will usually lack accompanying metadata in a controlled scheme.

The DOI system does not have this approach and should not be compared directly to such identifier schemes. Various applications using such enabling technologies with added features have been devised that meet some of the features offered by the DOI system for specific sectors (e.g., ARK).

A DOI name does not depend on the object's location and, in this way, is similar to a Uniform Resource Name (URN) or PURL but differs from an ordinary URL. URLs are often used as substitute identifiers for documents on the Internet although the same document at two different locations has two URLs. By contrast, persistent identifiers such as DOI names identify objects as first class entities: two instances of the same object would have the same DOI name. In May, 2024, an Internet Draft was introduced to register the "doi" scheme,[34]. Many experts were not aware of this draft,[35] and the latest draft has currently expired.

Resolution

[edit]

To resolve a DOI name, it may be input to a DOI resolver, such as one at the official website https://doi.org/.

DOI name resolution is provided through the Handle System, which is an infrastructure developed and operated by CNRI (Corporation for National Research Initiatives), and is freely available to any user encountering a DOI name. Resolution redirects the user from a DOI name to one or more pieces of typed data: URLs representing instances of the object, services such as e-mail, or one or more items of metadata. To the Handle System, a DOI name is a handle, and so has a set of values assigned to it and may be thought of as a record that consists of a group of fields. Each handle value must have a data type specified in its <type> field, which defines the syntax and semantics of its data. While a DOI persistently and uniquely identifies the object to which it is assigned, DOI resolution may not be persistent, due to technical and administrative issues.

Another approach, which avoids typing or copying and pasting into a resolver is to include the DOI in a document as a URL which uses the resolver as an HTTP proxy, such as https://doi.org/ (preferred)[36] or http://dx.doi.org/, both of which support HTTPS. For example, the DOI 10.1000/182 can be included in a reference or hyperlink as https://doi.org/10.1000/182. This approach allows users to click on the DOI as a normal hyperlink. Indeed, as previously mentioned, this is how Crossref recommends that DOIs always be represented (preferring HTTPS over HTTP), so that if they are cut-and-pasted into other documents, emails, etc., they will be actionable.

An interesting consequence of the fact that DOIs depend entirely on CNRI's Handle System infrastructure (whereby CNRI operates the global root servers and wrote the protocol) is that the proxy services DOI.org/<#> and hdl.handle.net/<#> are interoperable. For example, the following URIs resolve to the same publication:
https://doi.org/10.1016/S0021-9258(19)52451-6
https://hdl.handle.net/10.1016/S0021-9258(19)52451-6

There are other DOI resolvers and HTTP Proxies apart from NCRI's Handle System. At the beginning of the year 2016, a new class of alternative DOI resolvers was started by http://doai.io/ (now discontinued [37]). This service was unusual in that it tried to find a non-paywalled (often author archived) version of a title and redirected the user to that instead of the publisher's version.[38] Since then, other open-access favoring DOI resolvers have been created, notably https://oadoi.org/ in October 2016[39] (rebranded in 2017 as https://unpaywall.org/). While traditional DOI resolvers solely rely on the Handle System, alternative DOI resolvers first consult multiple Open Access resources such as institutional libraries with the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH), or indexing services based in OAI-PMH, such as BASE (Bielefeld Academic Search Engine).[37][39]

An alternative to HTTP proxies is to use one of a number of add-ons and plug-ins for browsers, thereby avoiding the conversion of the DOIs to URLs,[40] which depend on domain names and may be subject to change, while still allowing the DOI to be treated as a normal hyperlink. A disadvantage of this approach for publishers is that, at least at present, most users will be encountering the DOIs in a browser, mail reader, or other software which does not have one of these plug-ins installed.

IDF organizational structure

[edit]
Logo of the foundation

The International DOI Foundation (IDF), a non-profit organization created in 1997, is the governance body of the DOI system.[41] It safeguards all intellectual property rights relating to the DOI system, manages common operational features, and supports the development and promotion of the DOI system. The IDF ensures that any improvements made to the DOI system (including creation, maintenance, registration, resolution and policymaking of DOI names) are available to any DOI registrant. It also prevents third parties from imposing additional licensing requirements beyond those of the IDF on users of the DOI system.

The IDF is controlled by a Board elected by the members of the Foundation, with an appointed Managing Agent who is responsible for co-ordinating and planning its activities. Membership is open to all organizations with an interest in electronic publishing and related enabling technologies. The IDF holds annual open meetings on the topics of DOI and related issues.

Registration agencies, appointed by the IDF, provide services to DOI registrants: they allocate DOI prefixes, register DOI names, and provide the necessary infrastructure to allow registrants to declare and maintain metadata and state data. Registration agencies are also expected to actively promote the widespread adoption of the DOI system, to cooperate with the IDF in the development of the DOI system as a whole, and to provide services on behalf of their specific user community. A list of current RAs is maintained by the International DOI Foundation. The IDF is recognized as one of the federated registrars for the Handle System by the DONA Foundation (of which the IDF is a board member), and is responsible for assigning Handle System prefixes under the top-level 10 prefix.[42]

Registration agencies generally charge a fee to assign a new DOI name; parts of these fees are used to support the IDF. The DOI system overall, through the IDF, operates on a not-for-profit cost recovery basis.

Standardization

[edit]

The DOI system is an international standard developed by the International Organization for Standardization in its technical committee on identification and description, TC46/SC9.[43] The Draft International Standard ISO/DIS 26324, Information and documentation – Digital Object Identifier System met the ISO requirements for approval. The relevant ISO Working Group later submitted an edited version to ISO for distribution as an FDIS (Final Draft International Standard) ballot,[44] which was approved by 100% of those voting in a ballot closing on 15 November 2010.[45] The final standard was published on 23 April 2012.[2]

DOI is a registered URI under the info URI scheme specified by IETF RFC 4452.[46] info:doi/ is the infoURI Namespace of Digital Object Identifiers.[47]

The DOI syntax is a NISO standard, first standardized in 2000, ANSI/NISO Z39.84-2005 Syntax for the Digital Object Identifier.[48]

The maintainers of the DOI system have registered a DOI namespace for URNs.[49]

See also

[edit]

Notes

[edit]
  1. ^ Other registries are identified by other strings at the start of the prefix.

References

[edit]
  1. ^ Morgan, Cliff (n.d.) [This article is an expanded and updated version of a presentation to the Wiley Library Advisory Board, 18 November 1998.]. "The DOI (Digital Object Identifier)". Serials. UKSG (formerly United Kingdom Serials Group). Archived from the original on 2 August 2007. Retrieved 30 September 2024.
  2. ^ a b "ISO 26324:2012(en), Information and documentation – Digital object identifier system". ISO. Archived from the original on 17 June 2016. Retrieved 20 April 2016.
  3. ^ "The Handle System". Handle.Net Registry. Archived from the original on 7 January 2023.
  4. ^ "Resources (including Factsheets)". DOI. Archived from the original on 25 December 2022.
  5. ^ Witten, Ian H.; Bainbridge, David & Nichols, David M. (2010). How to Build a Digital Library (2nd ed.). Morgan Kaufmann. pp. 352–253. ISBN 978-0-12-374857-7.
  6. ^ Langston, Marc; Tyler, James (2004). "Linking to Journal Articles in an Online Teaching Environment: The Persistent Link, DOI, and OpenURL". The Internet and Higher Education. 7 (1): 51–58. doi:10.1016/j.iheduc.2003.11.004.
  7. ^ "How the "Digital Object Identifier" Works". BusinessWeek. 23 July 2001. Archived from the original on 2 October 2010. Retrieved 20 April 2010. Assuming the publishers do their job of maintaining the databases, these centralized references, unlike current web links, should never become outdated or broken
  8. ^ Liu, Jia (2021). "Digital Object Identifier (DOI) Under the Context of Research Data Librarianship". Journal of eScience Librarianship. 10 (2) e1180. doi:10.7191/jeslib.2021.1180.
  9. ^ Paskin, Norman (2010), "Digital Object Identifier (DOI) System", Encyclopedia of Library and Information Sciences (3rd ed.), Taylor and Francis, pp. 1586–1592
  10. ^ a b Davidson, Lloyd A.; Douglas, Kimberly (December 1998). "Digital Object Identifiers: Promise and problems for scholarly publishing". Journal of Electronic Publishing. 4 (2). doi:10.3998/3336451.0004.203.
  11. ^ "Welcome to the DOI System". DOI. 28 June 2010. Archived from the original on 13 August 2010. Retrieved 7 August 2010.
  12. ^ "What is a DOI?". DOI Foundation. Retrieved 6 February 2025.
  13. ^ "Who are the Members & Users?". DOI. Retrieved 6 February 2025. "DOI News, April 2011: 1. DOI System exceeds 50 million assigned identifiers". DOI. 20 April 2011. Archived from the original on 27 July 2011. Retrieved 3 July 2011.
  14. ^ "Important Alerts". DOI.
  15. ^ "doi info & guidelines". CrossRef.org. Publishers International Linking Association, Inc. 2013. Archived from the original on 21 October 2002. Retrieved 10 June 2016. All DOI prefixes begin with "10" to distinguish the DOI from other implementations of the Handle System followed by a four-digit number or string (the prefix can be longer if necessary).
  16. ^ "Factsheet—Key Facts on Digital Object Identifier System". International DOI Foundation. 6 June 2016. Archived from the original on 5 June 2016. Retrieved 1 November 2024. Over 18,000 DOI name prefixes within the DOI System
  17. ^ "DOI Handbook—2 Numbering". International DOI Foundation. 1 February 2016. Archived from the original on 30 June 2014. Retrieved 10 June 2016. The registrant code may be further divided into sub-elements for administrative convenience if desired. Each sub-element of the registrant code shall be preceded by a full stop.
  18. ^ "Frequently asked questions about the DOI system: 6. What can a DOI name be assigned to?". DOI Foundation. 3 July 2018. Archived from the original on 16 February 2023. Retrieved 19 July 2018.
  19. ^ "DOI Handbook – Numbering". doi.org. 13 February 2014. Section 2.6.1 Screen and print presentation. Archived from the original on 30 June 2014. Retrieved 30 June 2014.
  20. ^ "DOI Display Guidelines". Archived from the original on 24 November 2016. Retrieved 19 October 2016.
  21. ^ "New Crossref DOI display guidelines are on the way". Archived from the original on 19 October 2016. Retrieved 19 October 2016.
  22. ^ "Persistence as a Design Feature of the DOI System". Archived from the original on 27 September 2025. Retrieved 6 October 2025.
  23. ^ Powell, Andy (June 1998). "Resolving DOI Based URNs Using Squid: An Experimental System at UKOLN". D-Lib Magazine. doi:10.1045/june98-powell. ISSN 1082-9873. Archived from the original on 13 June 2010. Retrieved 23 April 2010.
  24. ^ ChrissieCW. "Crossref Revises DOI Display Guidelines - Crossref". crossref.org. Archived from the original on 25 April 2018. Retrieved 25 April 2018.
  25. ^ "Japan Link Center(JaLC)". japanlinkcenter.org. Archived from the original on 29 September 2020. Retrieved 6 August 2022.
  26. ^ Green, T. (2009). "We Need Publishing Standards for Datasets and Data Tables". Research Information. doi:10.1787/603233448430.
  27. ^ "multilingual European DOI Registration Agency". mEDRA.org. 2003. Archived from the original on 1 February 2018. Retrieved 2 February 2018.
  28. ^ J. Levine (October 2015). Assigning Digital Object Identifiers to RFCs. Internet Architecture Board. doi:10.17487/RFC7669. RFC 7669. Informational. sec. 3.
  29. ^ Timmer, John (6 March 2010). "DOIs and their discontents". Ars Technica. Archived from the original on 8 March 2013. Retrieved 5 March 2013.
  30. ^ DeRisi, Susanne; Kennison, Rebecca; and Twyman, Nick (2003). "Editorial: The what and whys of DOIs". PLoS Biology. 1 (2): e57. doi:10.1371/journal.pbio.0000057. PMC 261894. PMID 14624257. Open access icon
  31. ^ Franklin, Jack (2003). "Open access to scientific and technical information: the state of the art". In Grüttemeier, Herbert; Mahon, Barry (eds.). Open access to scientific and technical information: state of the art and future trends. IOS Press. p. 74. ISBN 978-1-58603-377-4. Archived from the original on 7 August 2022. Retrieved 7 August 2022.
  32. ^ "DOI System and Internet Identifier Specifications". doi.org. 18 May 2010. Archived from the original on 26 June 2010. Retrieved 7 August 2010.
  33. ^ "DOI System and standard identifier registries". doi.org. Archived from the original on 26 June 2010. Retrieved 7 August 2010.
  34. ^ "The "doi" URI Scheme". ietf.org. Retrieved 14 October 2025.{{cite web}}: CS1 maint: url-status (link)
  35. ^ Van de Sompel, Herbert (13 November 2024). "The DOI URI Scheme: Utility or Branding?". WSDL. Retrieved 14 October 2025.{{cite web}}: CS1 maint: url-status (link)
  36. ^ International DOI Foundation (7 August 2014). "Resolution". DOI Handbook. Archived from the original on 31 March 2015. Retrieved 19 March 2015.
  37. ^ a b "DOAI". CAPSH (Committee for the Accessibility of Publications in Sciences and Humanities). Archived from the original on 25 August 2016. Retrieved 6 August 2016.
  38. ^ Schonfeld, Roger C. (3 March 2016). "Co-opting 'Official' Channels through Infrastructures for Openness". The Scholarly Kitchen. Archived from the original on 19 October 2016. Retrieved 17 October 2016.
  39. ^ a b Piwowar, Heather (25 October 2016). "Introducing oaDOI: resolve a DOI straight to OA". Archived from the original on 17 March 2017. Retrieved 17 March 2017.
  40. ^ "DOI System Tools". Archived from the original on 8 February 2017. Retrieved 7 February 2017.
  41. ^ "Chapter 7: The International DOI Foundation". DOI Handbook. doi.org. Archived from the original on 10 July 2015. Retrieved 8 July 2015.
  42. ^ "Multi-Primary Administrators". DONA Foundation. Archived from the original on 14 January 2017. Retrieved 7 February 2017.
  43. ^ "Digital object identifier (DOI) becomes an ISO standard". International Organization for Standardization. 10 May 2012. Archived from the original on 2 August 2012. Retrieved 10 May 2012.
  44. ^ "Standards and Specifications". Overviews & Standards. doi.org. 28 June 2010. Archived from the original on 26 June 2010. Retrieved 7 August 2010.
  45. ^ "Standards and Specifications: 1. ISO TC46/SC9 Standards". Overviews & Standards. doi.org. 18 November 2010. Archived from the original on 4 July 2011. Retrieved 3 July 2011.
  46. ^ RFC 4452
  47. ^ "About "info" URIs – Frequently Asked Questions". Info-uri.info. Archived from the original on 27 September 2010. Retrieved 7 August 2010.
  48. ^ "ANSI/NISO Z39.84-2005 Syntax for the Digital Object Identifier" (PDF). National Information Standards Organization. Archived (PDF) from the original on 25 June 2021. Retrieved 25 June 2021.
  49. ^ "Namespace Registration for Digital Object Identifier (DOI)". IANA.
[edit]