Skip to content
#

sentence

Here are 157 public repositories matching this topic...

Tokenization is a way of separating a piece of text into smaller units called tokens. Here, tokens can be either words, characters, or subwords. Hence, tokenization can be broadly classified into 3 types – word, character, and subword (n-gram characters) tokenization.

  • Updated Jun 30, 2021
  • Jupyter Notebook

This repository contains the implementation of a fine-tuned Llama2 chatbot using QLoRA, tailored to provide detailed information and recommendations about movies. The model is fine-tuned on the IMDB dataset, enabling it to generate informed and contextually relevant responses.

  • Updated May 19, 2024
  • Jupyter Notebook

This package hosts an array of frequently used regex validations and regex expression evaluation functionalities. In general, String check encompasses last word check, middle word check, first word check, sentence validation, phone number validation, name validation with or without honorific, password with both default parameter settings and cus…

  • Updated Apr 6, 2018
  • JavaScript

Improve this page

Add a description, image, and links to the sentence topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the sentence topic, visit your repo's landing page and select "manage topics."

Learn more