{"id":1186,"date":"2016-09-27T16:33:45","date_gmt":"2016-09-27T21:33:45","guid":{"rendered":"http:\/\/blogs.ams.org\/beyondreviews\/?p=1186"},"modified":"2018-03-14T08:29:11","modified_gmt":"2018-03-14T13:29:11","slug":"metadata","status":"publish","type":"post","link":"https:\/\/blogs.ams.org\/beyondreviews\/2016\/09\/27\/metadata\/","title":{"rendered":"Metadata"},"content":{"rendered":"<p>MathSciNet is full of metadata. \u00a0We create our own metadata. \u00a0We receive metadata from many of the publishers of the journals we cover. \u00a0So what are\u00a0metadata? \u00a0(Or what <a href=\"https:\/\/xkcd.com\/1429\/\"><em>is<\/em><\/a>\u00a0metadata?) \u00a0The simplest explanation of metadata is that they are\u00a0a type of data that describes other data. \u00a0The classical example is the metadata found in card catalogs from libraries.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-1993\" src=\"http:\/\/blogs.ams.org\/beyondreviews\/files\/2018\/03\/cardcat.jpg\" width=\"1280\" height=\"719\" alt=\"A card from a card catalog, with annottions\" srcset=\"https:\/\/blogs.ams.org\/beyondreviews\/files\/2018\/03\/cardcat.jpg 1280w, https:\/\/blogs.ams.org\/beyondreviews\/files\/2018\/03\/cardcat-300x169.jpg 300w, https:\/\/blogs.ams.org\/beyondreviews\/files\/2018\/03\/cardcat-768x431.jpg 768w, https:\/\/blogs.ams.org\/beyondreviews\/files\/2018\/03\/cardcat-1024x575.jpg 1024w\" sizes=\"auto, (max-width: 1280px) 100vw, 1280px\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>Lots of information is on the card. \u00a0Note that before the annotation, nothing is labeled. There are accepted rules that tell a librarian (or a patron) what each piece of data is. \u00a0For most pieces of this data, a non-librarian would be likely to figure out what everything meant. <!--more--><\/p>\n<p>In an online catalog (such as from the Library of Congress), you might see:<\/p>\n<table>\n<tbody>\n<tr valign=\"top\">\n<td align=\"right\"><strong>Personal name:<\/strong><\/td>\n<td>Malinowski, Bronislaw, 1884-1942.<\/td>\n<\/tr>\n<tr valign=\"top\">\n<td align=\"right\"><strong>Title:<\/strong><\/td>\n<td>Magic, science and religion and other essays<\/td>\n<\/tr>\n<tr valign=\"top\">\n<td align=\"right\"><strong>Published\/Created<\/strong>:<\/td>\n<td>Boston, Beacon Press, 1948.<\/td>\n<\/tr>\n<tr valign=\"top\">\n<td align=\"right\"><strong>LCCN Permalink:\u00a0<\/strong><\/td>\n<td><a href=\"https:\/\/lccn.loc.gov\/48006987\">https:\/\/lccn.loc.gov\/48006987<\/a><\/td>\n<\/tr>\n<tr valign=\"top\">\n<td align=\"right\"><strong>Description<\/strong>:<\/td>\n<td>xii, 327 p. front. 22 cm.<\/td>\n<\/tr>\n<tr valign=\"top\">\n<td align=\"right\"><strong>LC classification (full)<\/strong>:<\/td>\n<td>GN8 .M286<\/td>\n<\/tr>\n<tr valign=\"top\">\n<td align=\"right\"><strong>LC classification (partial)<\/strong>:<\/td>\n<td>GN8<\/td>\n<\/tr>\n<tr valign=\"top\">\n<td align=\"right\"><strong>Related names<\/strong>:<\/td>\n<td>Redfield, Robert, 1897-ed.<\/td>\n<\/tr>\n<tr valign=\"top\">\n<td align=\"right\"><strong>Contents<\/strong>:<\/td>\n<td>Magic, science and religion.&#8211;Myth in primitive psychology.&#8211;Baloma: the spirits of the dead in the Trobriand Islands.&#8211;The problem of meaning in primitive language.&#8211;An anthropological analysis of war.<\/td>\n<\/tr>\n<tr valign=\"top\">\n<td align=\"right\"><strong>Subjects<\/strong>:<\/td>\n<td>Anthropology.<\/td>\n<\/tr>\n<tr valign=\"top\">\n<td align=\"right\"><strong>Notes:<\/strong><\/td>\n<td>Bibliographic footnotes.<\/td>\n<\/tr>\n<tr valign=\"top\">\n<td align=\"right\"><strong>LCCN:<\/strong><\/td>\n<td>48006987<\/td>\n<\/tr>\n<tr valign=\"top\">\n<td align=\"right\"><strong>Dewey class no.<\/strong><\/td>\n<td>572.04<\/td>\n<\/tr>\n<tr valign=\"top\">\n<td align=\"right\"><strong>Type of material:<\/strong><\/td>\n<td>Book<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Notice that now the data fields are all labeled.<\/p>\n<p>However, in modern libraries, there is more information than (normally) meets the eye. Here is a more detailed view of the digital record, as provided by the <a href=\"https:\/\/www.loc.gov\/marc\/umb\/um01to06.html\">MARC<\/a> record:<br \/>\n<code><br \/>\n000 01142cam a22002891 4500<br \/>\n001 8352402<br \/>\n005 20050721194408.0<br \/>\n008 750617s1948 mau b 000 0 eng<br \/>\n035 __ |9 (DLC) 48006987<br \/>\n906 __ |a 7 |b cbc |c oclcrpl |d u |e ncip |f 19 |g y-gencatlg<br \/>\n010 __ |a 48006987<br \/>\n035 __ |a (OCoLC)1395532<br \/>\n040 __ |a DLC |c FTS |d OCoLC |d DLC<br \/>\n042 __ |a premarc<br \/>\n050 00 |a GN8 |b .M286<br \/>\n082 __ |a 572.04<br \/>\n100 1_ |a Malinowski, Bronislaw, |d 1884-1942.<br \/>\n245 10 |a Magic, science and religion, and other essays; |c selected and with an introd. by Robert Redfield.<br \/>\n260 __ |a Boston, |b Beacon Press, |c 1948.<br \/>\n300 __ |a xii, 327 p. |b front. |c 22 cm.<br \/>\n504 __ |a Bibliographical footnotes.<br \/>\n505 0_ |a Magic, science and religion.--Myth in primitive psychology.--Baloma: the spirits of the dead in the Trobriand Islands.--The problem of meaning in primitive language.--An anthropological analysis of war.<br \/>\n650 _0 |a Anthropology.<br \/>\n700 1_ |a Redfield, Robert, |d 1897- |e ed.<br \/>\n985 __ |e OCLC REPLACEMENT cdsdistr<br \/>\n991 __ |b c-GenColl |h GN8 |i .M286 |t Copy 1 |w OCLCREP<br \/>\n991 __ |b c-GenColl |h GN8 |i .M286 |p 00017792015 |t Copy 2 |w CCF<br \/>\n<\/code><\/p>\n<p>Wow! Lots more information! Moreover, I can&#8217;t understand most of it. \u00a0It is all labeled, but using a secret code. \u00a0How is this helpful? \u00a0Well, MARC stands for &#8220;<span style=\"text-decoration: underline\">MA<\/span>chine <span style=\"text-decoration: underline\">R<\/span>eadable <span style=\"text-decoration: underline\">C<\/span>ataloging&#8221;. \u00a0So I&#8217;m not supposed to be able to understand this, but a computer parses the information easily. \u00a0This is an example of metadata for the digital world. \u00a0(The\u00a0machine code is available <a href=\"https:\/\/lccn.loc.gov\/48006987\/marcxml\">here<\/a>\u00a0as an XML file, in case you want to be unable to read it in another format.)<\/p>\n<h2>Metadata at Mathematical Reviews<\/h2>\n<p>In the right hands, metadata are pieces of\u00a0information that is developed, structured, and maintained to describe materials in ways that meet the particular needs of a group of users. At Mathematical Reviews, metadata for each bibliographic entry in MathSciNet are created to serve the research needs of mathematicians, librarians, and others who work with the mathematics literature. Metadata describe each item listed in the database in terms of its type of publication, creators and publisher, length, edition, online availability, subject area (using <a href=\"http:\/\/www.ams.org\/mathscinet\/msc\/msc2010.html\">MSCs<\/a>), and other identifying characteristics.<\/p>\n<p>MathSciNet metadata provide consistent and well-structured information about what has been published in mathematics over time and how publications are related to each other based on elements such as authors, publication date, subject matter, references, and editions. Staff members at Mathematical Reviews work to assure the consistency and accuracy of the metadata created for roughly\u00a0120,000 items each year. High quality metadata are essential to the many features of MathSciNet including <a href=\"http:\/\/www.ams.org\/mathscinet\/searchauthors.html\">author<\/a> and <a href=\"http:\/\/www.ams.org\/mathscinet\/search.html\">publications<\/a> searching, the <a href=\"http:\/\/www.ams.org\/mathscinet\/citations.html\">Citation Database<\/a>, <a href=\"http:\/\/www.ams.org\/mathscinet\/MRAuthorID\/42675\">Author Profile pages<\/a>, and the rich and growing set of links within MathSciNet that enable efficient and accurate\u00a0exploration of the mathematical literature.<\/p>\n<p>The creation of bibliographic metadata at Mathematical Reviews takes place at two levels. The first level involves the bibliographic description of books, journals and issues. \u00a0At this level, the work resembles the cataloging at a library. \u00a0The second level focuses on the description of individual papers within journal issues and book collections. \u00a0At this second level, the work can be quite different, working in finer\u00a0detail with a narrower range of data. \u00a0 Cataloging principles and standards guide the bibliographic description of materials at Mathematical Reviews. One important set of principles is\u00a0found in the <a href=\"http:\/\/www.rda-rsc.org\/content\/rda_faq#1\">Resource Description and Access<\/a> (RDA) standards. \u00a0(See also <a href=\"https:\/\/www.loc.gov\/aba\/rda\/index.html\">this page<\/a> from the\u00a0Library of Congress.) These and other principles and standards\u00a0must be\u00a0interpreted and incorporated into a cataloging framework designed to meet the information needs of the mathematical community. \u00a0Adhering to principles and standards allows us to maintain continuity and consistency in MathSciNet across all the literature we cover,\u00a0and across the wider bibliographic world (e.g., your library).<\/p>\n<h3>Preliminary Data<\/h3>\n<p>Publishers create metadata for their publications. \u00a0One of the earliest resources was Bowker&#8217;s\u00a0<em><a href=\"http:\/\/www.bowker.com\/products\/Books-In--Print.html\">Books in Print<\/a><\/em>. \u00a0(See also <a href=\"https:\/\/en.wikipedia.org\/wiki\/R.R._Bowker\">this<\/a>.) Years ago, this was produced annually as a bound volume that you could find in your library or bookstore. \u00a0It attempted to list\u00a0the bibliographic information of every book printed in a given year. \u00a0It was a book about books &#8211; metadata! \u00a0 For scholarly and academic journals, there was Ulrich&#8217;s\u00a0<em><a href=\"http:\/\/www.ulrichsweb.com\/ulrichsweb\/faqs.asp\">Periodicals Directory<\/a><\/em>. \u00a0 Bowker&#8217;s and Ulrich&#8217;s only had the information from the publishers who gave it to them. Both resources were widely used in libraries, for purchasing as well as for identification and awareness purposes. Bowker\u2019s was also heavily used by bookstores and book distributors, particularly for making purchases. Needless to say, it was a good idea for publishers to provide data to Bowker&#8217;s and to Ulrich&#8217;s. \u00a0Both are now solely online.<\/p>\n<p>For books, Bowker&#8217;s established a format for the electronic delivery of metadata. \u00a0Data in this format was frequently used by both online bookstores and brick-and-mortar bookstores. \u00a0Other formats exist, such as <a href=\"https:\/\/kdp.amazon.com\/help?topicId=A294SHSUYLKTA6\">what amazon.com uses<\/a>. \u00a0The Library of Congress records described above are important. \u00a0When readying\u00a0a book, the publisher applies for &#8220;<a href=\"https:\/\/www.loc.gov\/publish\/cip\/\">CIP data<\/a>&#8221; &#8211; cataloging in publishing data. \u00a0The record returned to the publisher from the Library of Congress becomes an important component of the metadata attached to the publication.<\/p>\n<p>Mathematical Reviews receives\u00a0metadata from publishers through our\u00a0Preliminary Data program. Publisher metadata received in this program are used to create preliminary entries in MathSciNet, while editorial decisions, cataloging, and classification are completed. These items are marked in MathSciNet with the icon \u00a0<a href=\"http:\/\/blogs.ams.org\/beyondreviews\/files\/2016\/09\/Screen-Shot-prelim.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-1303\" src=\"http:\/\/blogs.ams.org\/beyondreviews\/files\/2016\/09\/Screen-Shot-prelim.png\" alt=\"screen-shot-prelim\" width=\"91\" height=\"30\" \/><\/a>. \u00a0Preliminary MathSciNet entries speed up\u00a0the availability of information about publications by several weeks.<\/p>\n<p>While preliminary data accelerate the posting of items to MathSciNet, it is\u00a0<span style=\"text-decoration: underline\">not<\/span> the case that the data arrive and we can just blithely post away! \u00a0Data arrive in all sorts of formats. \u00a0And with various extras. \u00a0Upon arrival, preliminary data go through an initial check, during which materials such as cover images, front and back matter, and other non-article information are removed. Duplicate materials are also removed and issue and journal level information is verified and corrected as needed. The papers are then sent to our editors, who select those for which permanent listings in MathSciNet will be made.<\/p>\n<p>The selected papers move through the departments at Mathematical Reviews and the bibliographic data are checked, edited, and enhanced at each point. Discrepancies between the data received and the published online version are addressed and additional information about the paper may be added. <a href=\"http:\/\/blogs.ams.org\/beyondreviews\/2015\/09\/14\/who-wrote-that\/\">Author disambiguation<\/a> is done, links to <a href=\"http:\/\/www.ams.org\/mathscinet\/MRAuthorID\/189017\">Author Profile pages<\/a> are made, <a href=\"http:\/\/www.ams.org\/instcode\">institutional codes<\/a> are added, and <a href=\"http:\/\/www.ams.org\/mathscinet\/msc\/msc2010.html\">classifications<\/a> assigned. When these steps are completed, the preliminary entry is replaced with its permanent and complete MathSciNet entry. Articles from approximately 800 journals now arrive through our Preliminary Data program. We continue to add journals to the program and look forward to working with additional publishers in the future.<\/p>\n<p>Accurate, timely, and consistent metadata are integral to the information and services we provide to the mathematical community. MathSciNet metadata development is an ongoing process accomplished by experienced and dedicated staff throughout Mathematical Reviews.<\/p>\n<p>&nbsp;<\/p>\n<hr \/>\n<p>I am very grateful to Kathy Wolcott, \u00a0Librarian and Manager of the Mathematical Reviews\u00a0Acquisitions Department, for help with this post. \u00a0Large swathes of this post are <del>plagiarized<\/del> verbatim from a document written by her.<\/p>\n<hr \/>\n<p>Is &#8220;data&#8221; singular or plural? \u00a0For that question, I defer to the <a href=\"https:\/\/xkcd.com\/1429\/\">wisdom of xkcd<\/a>.<\/p>\n<div style=\"margin-top: 0px; margin-bottom: 0px;\" class=\"sharethis-inline-share-buttons\" ><\/div>","protected":false},"excerpt":{"rendered":"<p>MathSciNet is full of metadata. \u00a0We create our own metadata. \u00a0We receive metadata from many of the publishers of the journals we cover. \u00a0So what are\u00a0metadata? \u00a0(Or what is\u00a0metadata?) \u00a0The simplest explanation of metadata is that they are\u00a0a type of &hellip; <a href=\"https:\/\/blogs.ams.org\/beyondreviews\/2016\/09\/27\/metadata\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n<div style=\"margin-top: 0px; margin-bottom: 0px;\" class=\"sharethis-inline-share-buttons\" data-url=https:\/\/blogs.ams.org\/beyondreviews\/2016\/09\/27\/metadata\/><\/div>\n","protected":false},"author":86,"featured_media":1303,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":true,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[6],"tags":[],"class_list":["post-1186","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-general-information"],"jetpack_featured_media_url":"https:\/\/blogs.ams.org\/beyondreviews\/files\/2016\/09\/Screen-Shot-prelim.png","jetpack_shortlink":"https:\/\/wp.me\/s6C2KK-metadata","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/blogs.ams.org\/beyondreviews\/wp-json\/wp\/v2\/posts\/1186","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blogs.ams.org\/beyondreviews\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogs.ams.org\/beyondreviews\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.ams.org\/beyondreviews\/wp-json\/wp\/v2\/users\/86"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.ams.org\/beyondreviews\/wp-json\/wp\/v2\/comments?post=1186"}],"version-history":[{"count":52,"href":"https:\/\/blogs.ams.org\/beyondreviews\/wp-json\/wp\/v2\/posts\/1186\/revisions"}],"predecessor-version":[{"id":1994,"href":"https:\/\/blogs.ams.org\/beyondreviews\/wp-json\/wp\/v2\/posts\/1186\/revisions\/1994"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/blogs.ams.org\/beyondreviews\/wp-json\/wp\/v2\/media\/1303"}],"wp:attachment":[{"href":"https:\/\/blogs.ams.org\/beyondreviews\/wp-json\/wp\/v2\/media?parent=1186"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogs.ams.org\/beyondreviews\/wp-json\/wp\/v2\/categories?post=1186"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogs.ams.org\/beyondreviews\/wp-json\/wp\/v2\/tags?post=1186"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}