Skip to content
View shigapov's full-sized avatar
Block or Report

Block or report shigapov

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
shigapov/README.md

🌱 Natural Language Processing

  • bbw is a tool for entity linking, entity typing and relation extraction using any Wikibase knowledge graph (e.g., Wikidata) and meta-lookup (metasearch over SearX) [docs], [paper]
  • spaCyOpenTapioca is a spaCy pipeline for named entity linker OpenTapioca [spaCy Universe], [docs]
  • Reichsanzeiger NLP is a NER/NEL corpus & annotation guidelines for the historical German newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819-1945)
  • blatt is a NLP-helper for OCR-ed pages in PAGE XML format (it's used to structure data from OCR-ed pages) [docs]
  • cas2iob is a converter of UIMA CAS XMI files exported from the INCEpTION annotation platform into IOB TSV files
  • madabi is a repo containing ongoing work on building Mannheim Data Bibliography for MADATA

🌱 Knowledge graphs

🌱 Research Data Management

🌱 NFDI

Pinned

  1. UB-Mannheim/bbw UB-Mannheim/bbw Public

    Entity linking, entity typing and relation extraction: Matching CSV to a Wikibase instance (e.g., Wikidata) via Meta-lookup

    Python 67 9

  2. UB-Mannheim/RaiseWikibase UB-Mannheim/RaiseWikibase Public

    Knowledge graph construction: Fast inserts into a Wikibase instance

    Python 44 7

  3. wikibase-knowledge-graphs wikibase-knowledge-graphs Public

    A collection of open source tools and resources related to Wikibase knowledge graphs

    60 3

  4. UB-Mannheim/spacyopentapioca UB-Mannheim/spacyopentapioca Public

    A spaCy wrapper of OpenTapioca for named entity linking on Wikidata

    Python 90 8

  5. UB-Mannheim/reichsanzeiger-nlp UB-Mannheim/reichsanzeiger-nlp Public

    Reichsanzeiger-NLP: NER/NEL corpus for the German historical newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–1939)

    Shell 14 1

  6. UB-Mannheim/awesome-RDM UB-Mannheim/awesome-RDM Public

    A curated list of awesome RDM resources for researchers and organisations

    11 6