MC^2: A Multilingual Corpus of Minority Languages in China (Tibetan, Uyghur, Kazakh, and Mongolian)
-
Updated
Nov 24, 2023 - Python
MC^2: A Multilingual Corpus of Minority Languages in China (Tibetan, Uyghur, Kazakh, and Mongolian)
Fine-tune LLM for early Middle English lemmatization with data from LAEME.
This repository highlights the LLMs reasoning capabilities of ✨ Mistral / LLaMA-3 / Phi-3 / Gemma / Flan-T5 / GPT-4o ✨ in Targeted Sentiment Analysis in Russian / Translated to English mass-media 📊
Unsupervised Contextualized Document Representation, to appear in SustaiNLP 2021 EMNLP 2021
Must-read papers on relation extraction.
Wen Lai's Blog related to MT/NLP/ML
Awesome Lao Natural Language Processing
Crowdsource Platform for Low Resourced Language Annotation and Corpus Contribution
This repository provides HAWP: a dataset for Hindi Word Problem Solving and a baseline (LREC 2022)
English-Sinhala multilingual word embedding alignment resources
Teaching Large Language Models an Unseen Language on the Fly
Code for AACL23 paper "Benchmarking Procedural Language Understanding for Low-Resource Languages: A Case Study on Turkish"
Pashto Natural Language Processing Toolkit
Hausa Natural Language Processing Repository
This is the official repository contains the code, data, and models of the paper titled "Shironaam: Bengali News Headline Generation using Auxiliary Information", accepted for publication in Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL’23), May 2-6, 2023.
This is an official Leaderboard for the RuSentRel-1.1 dataset originally described in paper (arxiv:1808.08932)
Official implementation of the EACL Findings 2024 paper: Chem-FINESE: Validating Fine-Grained Few-shot Entity Extraction through Text Reconstruction
AAAI Knowledge NLP Submission
An overview of the possibilities of using TARS models for low language resources
Children StoryBooks for 180 langauges.
Add a description, image, and links to the low-resource-nlp topic page so that developers can more easily learn about it.
To associate your repository with the low-resource-nlp topic, visit your repo's landing page and select "manage topics."