Skip to content

Using NLP and RNN to build a clinical decision support model, taking input of structured medical record text (SOAP style records)

Notifications You must be signed in to change notification settings

yeahshow/word2vec_medical_record

Repository files navigation

word2vec_medical_record

This project aim to utilize neural network to analyze sequence input from structured free-text medical records. At this stage, models are trained to give categorical output and optimized with categorical loss. In addition, a word embedding can be generated during training.

word2vec model

Antoine Bordes, et al. "Joint Learning of Words and Meaning Representations for Open-Text Semantic Parsing." AISTATS(2012) https://www.hds.utc.fr/~bordesan/dokuwiki/lib/exe/fetch.php?id=en%3Apubli&cache=cache&media=en:bordes12aistats.pdf

Alexis Conneau, et al. "Very Deep Convolutional Networks for Natural Language Processing." (2016) https://arxiv.org/abs/1606.01781

Preprocessing the original data

The original data was stored in .xls document, extracted from a medical record database. We prepare our training data with the following procedures:

  • Remove any training data with missing item
  • Remove non-english characters
  • Remove all next-line character '\r\n'
  • Assign an UUID to each set of training data.

About

Using NLP and RNN to build a clinical decision support model, taking input of structured medical record text (SOAP style records)

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published