{"id":1609,"date":"2018-02-15T15:31:01","date_gmt":"2018-02-15T15:31:01","guid":{"rendered":"http:\/\/www.hirmeos.eu\/?p=1609"},"modified":"2018-03-15T17:12:58","modified_gmt":"2018-03-15T17:12:58","slug":"hirmeos-enhances-its-digital-platforms-with-entity-fishing-validation-of-the-nerd-services","status":"publish","type":"post","link":"https:\/\/www.hirmeos.eu\/2018\/02\/15\/hirmeos-enhances-its-digital-platforms-with-entity-fishing-validation-of-the-nerd-services\/","title":{"rendered":"HIRMEOS enhances its digital platforms with entity-fishing: Validation of the NERD Services."},"content":{"rendered":"

Validation of the NERD Services<\/strong><\/p>\n

The platforms involved in the project have completed the integration of entity-fishing (NERD \u2013 Named Entities Recognition and Disambiguation) services. Several applications have been successfully tested on the platforms and over the coming months they will be further developed. Below are a few brief notes on the work done. To find out more participate in the webinar ENHANCING PUBLISHING PLATFORMS: ENTITIES EXTRACTION FOR OPEN ACCESS MONOGRAPHS, Monday, 05.03.2018, 14:00 \u2013 15:00 (CET) (Click here to register<\/a>).<\/p>\n

OpenEdition<\/h3>\n

OpenEdition has integrated Entity-fishing (NERD) system into Core (the central application for managing their publications). The data extracted from their monographs (few dozens documents) were stored in a database and indexed in Solr. The process can be described in 5 steps:<\/p>\n

    \n
  • the documents are processed, using their HTML representation to segment paragraphs and sentences;<\/li>\n
  • entities classified with Named Entity Class as PERSON and LOCATION with frequency above two, are then collected in a special storage;<\/li>\n
  • all other entities are further processed using NERD_KID service in order to predict their Named Entity Class on the base their Wikidata disambiguated entry;<\/li>\n
  • all entities from each books are then aggregated together, only ones of type LOCATION and PERSON are indexed.<\/li>\n<\/ul>\n

     <\/p>\n

    \"\"<\/p>\n

    <\/h3>\n

    \"\"<\/h3>\n

    <\/h3>\n

    G\u00f6ttingen University Press<\/h3>\n

    G\u00f6ttingen State and University Library has integrated Entity-fishing (NERD) service into the publishing workflow of G\u00f6ttingen University Press (GUP) in order to enable the semi-automatic indexing of its monographs.<\/p>\n

    Titles, abstract and metadata of the monographs published in GUP are processed by entity-fishing (NERD) and entities classified into the Named Entities Types of PERSON, LOCATION and PERIOD are collected and shown on the GUP homepage of the book, allowing the users to quickly find the monographs in which these entities appear.<\/p>\n

    Entity-fishing (NERD) service has been integrated in the workflow used to add new books in the library, in particular when processing the metadata of the monographs, the GUP editors have at their disposal now an on-the-fly call to Entity-fishing (NERD) that autocomplete the form.<\/p>\n

    On the book or collection page, the entities are then displayed in word clouds, so that the users can immediately recognize which topics are most frequently addressed in this collection. By clicking on the different facets the user comes directly to the titles where each entity occurs.<\/p>\n

    \"\"<\/h3>\n

     <\/p>\n

    \"\"<\/p>\n

    <\/h3>\n

    EKT \/ National Documentation Center<\/h3>\n

    EKT \/ National Documentation Center is using Open Monograph Press | Public Knowledge Project software as its e-Publishing infrastructure. Open Monograph Press (OMP) is an open source software platform for managing the editorial workflow required to see monographs, edited volumes and, scholarly editions through internal and external review, editing, cataloguing, production, and publication.<\/p>\n

    EKT has improved OMP with Entity-fishing support by integrating the service API to the OMP monographs\u2019 landing page in order to annotate the abstract with the NERD entities:<\/p>\n

      \n
    • Colorize abstract of the monograph based on the recognized entities and their types and provide extra information of each entity from Knowledge Base,<\/li>\n
    • Index all recognized entities and a facet browsing support or even a tag cloud browsing for simplify search and discovery of OMP content.<\/li>\n<\/ul>\n

      \u00a0\"\"<\/h3>\n

       <\/p>\n

      \"\"<\/p>\n

      Ubiquity Press<\/h3>\n

      Ubiquity Press has developed an internal service (called Archiver) which receives notifications from the existing company platform when a new article has been published (using a pub-sub architecture), and POSTs its content (after formatting it as a JSON as per NERD service specifications) to the entity-fishing (NERD) API in order to retrieve all the entities and store them locally.<\/p>\n

      This internal service exposes an API to the existing UP journal frontend, where the entities are shown to the reader as clickable links referring to the Wikipedia entry for the entity; the links live in a contextual section for each article, and are easily accessible while reading the content.<\/p>\n

      \"\"<\/h3>\n

      <\/h3>\n

      OAPEN<\/h3>\n

      The Entity-fishing (NERD) API will be integrated in the workflow when processing new publications entered by OAPEN. The integration will be used also on already existing English and German titles in the OAPEN Library. The result will be a publicly available data source containing all named entities from over 2000 titles.<\/p>\n

       <\/p>\n

       <\/p>\n

       <\/p>\n","protected":false},"excerpt":{"rendered":"

      Validation of the NERD Services The platforms involved in the project have completed the integration of entity-fishing (NERD \u2013 Named Entities Recognition and Disambiguation) services. Several applications have been successfully tested on the platforms and over the coming months they<\/p>\n","protected":false},"author":11,"featured_media":1621,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[68],"tags":[],"_links":{"self":[{"href":"https:\/\/www.hirmeos.eu\/wp-json\/wp\/v2\/posts\/1609"}],"collection":[{"href":"https:\/\/www.hirmeos.eu\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.hirmeos.eu\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.hirmeos.eu\/wp-json\/wp\/v2\/users\/11"}],"replies":[{"embeddable":true,"href":"https:\/\/www.hirmeos.eu\/wp-json\/wp\/v2\/comments?post=1609"}],"version-history":[{"count":4,"href":"https:\/\/www.hirmeos.eu\/wp-json\/wp\/v2\/posts\/1609\/revisions"}],"predecessor-version":[{"id":1620,"href":"https:\/\/www.hirmeos.eu\/wp-json\/wp\/v2\/posts\/1609\/revisions\/1620"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.hirmeos.eu\/wp-json\/wp\/v2\/media\/1621"}],"wp:attachment":[{"href":"https:\/\/www.hirmeos.eu\/wp-json\/wp\/v2\/media?parent=1609"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.hirmeos.eu\/wp-json\/wp\/v2\/categories?post=1609"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.hirmeos.eu\/wp-json\/wp\/v2\/tags?post=1609"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}