💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
-
Updated
May 31, 2024 - Python
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
Retrieval and Retrieval-augmented LLMs
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
A curated list of pretrained sentence and word embedding models
xmnlp:提供中文分词, 词性标注, 命名体识别,情感分析,文本纠错,文本转拼音,文本摘要,偏旁部首,句子表征及文本相似度计算等功能
1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
SGPT: GPT Sentence Embeddings for Semantic Search
Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need
unified embedding model
Compute Sentence Embeddings Fast!
BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences
A Structured Self-attentive Sentence Embedding
A Python vector database you just need - no more, no less.
Keyphrase or Keyword Extraction 基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model 的中文版代码)
The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to open an issue if you run into any trouble!
Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard
A curated list of awesome resources related to Semantic Search🔎 and Semantic Similarity tasks.
Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"
Add a description, image, and links to the sentence-embeddings topic page so that developers can more easily learn about it.
To associate your repository with the sentence-embeddings topic, visit your repo's landing page and select "manage topics."