Skip to content

It's Smart-Question Answering System on short as well as long documents. It can automatically find answers to matching questions directly from documents. The deep learning language model converts the questions and documents to semantic vectors to find the matching answer.

License

Notifications You must be signed in to change notification settings

SatyamSoni23/Smart-Question-Answering-System-on-Document

Repository files navigation

Smart-Question-Answering-System-on-Document

It's Smart-Question Answering System on short as well as long documents. It can automatically find answers to matching questions directly from documents. The deep learning language model converts the questions and documents to semantic vectors to find the matching answer.

Approches:

  • Question Answering System Using Simple Split and Cosine Similarity (Naive Approach)
  • Question Answering System Using Word2Vec Embedding Technique
  • Question Answering System Using Glove Embedding Technique
  • Question Answering System with Fine-Tuned BERT Technique
  • Question Answering System Using CDQA on Multiple Pdf Files

Challenges

  • Bert is a really powerful model for tackling a question-answering problem. However, it comes up with the limitation of 512 tokens and the documents were really longer than 512 tokens. In order to handle this limitation I wrote the function "expand_split_sentences", which split and expand sentences i.e., it makes paragraphs with lesser than 512 tokens and makes data frames of that paragraph. In this, more than one data frame contains the correct answer so we will find the best answer by finding the max start score.

Pretrained Model and Dataset Used

  • word2vec
  • glove
  • bert-large-uncased-whole-word-masking-finetuned-squad
  • bert-squad_1.1

Reference

Deployment

  • Method Selection

alt text

  • File Uploading

alt text

  • Ask Question

alt text

  • Processing Answer Using BERT

alt text

  • Processing Answer Using Fine-tuning BERT

alt text

About

It's Smart-Question Answering System on short as well as long documents. It can automatically find answers to matching questions directly from documents. The deep learning language model converts the questions and documents to semantic vectors to find the matching answer.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages