About

AI and Machine Learnings use in detecting pulmonary disorders such as Tuberculosis, pneumonia and many others has been a well-known fact for quite a while now. When — Coronavirus disease 2019 also known as COVID-19 was declared as a pandemic in 2020 those similar techniques could be used for the diagnosis of COVID-19, especially in developing countries where there is lack of specialized physicians, and test kits are in scarce supply. These inadequacies are leading to the misdiagnosis of the different pulmonary disorders, more specifically COVID-19. The objective of this project is to provide an overview on the use of Deep neural networks, both Vanilla neural networks and other pretrained models to present a quick solution to provide a classification between COVID-19, Bacterial Pneumonia, viral Pneumonia, Tuberculosis, and Normal/Healthy images.

It is critical to detect the positive cases as early as possible so as to prevent the further spread of this pandemic and to quickly treat affected patients. The need for auxiliary diagnostic tools has increased as there are no accurate automated toolkits available.

Recent findings obtained using radiology imaging techniques suggest that such images contain salient information about the COVID-19 virus. Application of advanced artificial intelligence (AI) techniques coupled with radiological imaging can be helpful for the accurate detection of this disease, and can also be assistive to overcome the problem of a lack of specialized physicians in remote villages.

Objective

Given X-ray images of patients, to build a machine learning model that will analyze and detect if the patient has COVID-19 or not and also classify between 3 other pulmonary Disorders Tuberculosis, Bacterial, and Viral Pneumonia.

This may not be clinically viable

Usage

Option 1: Docker

This is a containerized flask application with docker image put on docker hub. If azure url is not working Pull docker image

docker pull 0941924816/covid-detection-or-analysis:latest

Run docker image

docker run --rm -it  -p 8000:8000/tcp 0941924816/covid-detection-or-analysis:latest

Option 2:(no docker)

If you do not have docker installed make sure you have python3 and pip3 installed

1. Clone the repo

git clone https://github.com/Azariagmt/pulmonary-disorder-detection-using-x-ray-images.git

2. cd into repo

cd pulmonary-disorder-detection-using-x-ray-images

3.Install required dependencies:

pip3 install requirements.txt

4.Run Flask application

flask run

Datasets

The datasets utilized in this project are obtained from three publicly available sources. The first one is the Covid-19 Chest X-ray Database (Rahman et al.) which in its current version consists of 3616 COVID-19 positive cases along with 10,192 Normal, 6012 Lung Opacity (Non-COVID lung infection), and 1345 Viral Pneumonia images. The second one is the Chest X-Ray Images (Pneumonia) dataset (Mooney) consisting of 5,863 X-Ray images split into 2 categories (Pneumonia/Normal). The third dataset used is The Tuberculosis (TB) Chest X-ray Database (Rahman #) which contains CXR images of Normal (3500) and patients with TB (3500).

Repository overview

Structure
    ├── models (fetched from google drive due to large size)
    ├── modules	
    │   ├── load_models.py (fetches models from drive)
    │   ├── predict.py
    ├── notebooks	
    │   ├── Data Fetching and preprocessing
    │   |   ├── Get datasets.ipynb (Gets the datasets from sources specified in Datasets)
    │   |   ├── Data preprocessing.ipynb (Gets the data ready for training)
    |   └── Training (training notebooks of different neural network architectures)
    │       ├── DenseNet201.ipynb
    │       ├── InceptionResNetV2.ipynb
    │       ├── InceptionV3.ipynb
    │       ├── NasNetLarge.ipynb
    │       ├── ResNet101V2.ipynb
    │       ├── ResNet50V2.ipynb
    │       ├── VGG19.ipynb
    │       ├── Xception.ipynb
    ├── numpy arrays (Stored .npy and .npz files used for training and testing)
    ├── static
    ├── templates
    ├── app.py (flask application initialized here)
    └── Dockerfile

Results

Transfer learning was used in the building of the models, some reaching an accuracy of 98.4% on the dataset collected from the different sources above.

classification report

Model	Accuracy	Precision	Recall	F1-Score	Support
InceptionV3	97.37%	97%	97%	97%	4404
InceptionResNetV2	95.5%	97%	97%	97%	300
DenseNet121	98.27%	98%	98%	98%	4404
DenseNet169	98.07%	98%	98%	98%	4404
DenseNet201	98.27%	98%	98%	98%	4404
ResNet50V2	95%	95%	95%	95%	4404
ResNet101V2	97.12%	97%	97%	97%	4404
VGG16	97.39%	96%	96%	96%	4404
VGG19	97.48%	97%	97%	97%	4404
Xception	-	95%	95%	95%	4404

Confusion matrix for chosen models

These are the confusion matrices of tsome of the pretrained models used to build the models.

Xception

DenseNet201

ResNet50V2

Conclusion

This project has several limitations that can be overcome in future research. In particular, a more in-depth analysis requires much more patient data, especially those suffering from COVID-19(currently) and more research for the specific pulmonary disorder we're trying to predict in an area. A more interesting approach for future research would focus on distinguishing patients showing mild symptoms, and those that will actually need intubation or not. while these symptoms may not be accurately visualized on X-rays, or may not be visualized at all.

Furthermore, we will try to use our approach on bigger datasets, to solve other medical problems like cancer, tumors, etc. and also on other computer vision fields as energy, agriculture, and transport in the upcoming days. Future research directions will include the exploration of image data augmentation techniques to improve accuracy even more while avoiding overfitting. We observed that performance could be improved further, by increasing dataset size, using a data augmentation approach, and using hand-crafted features, in the future. More models will be trained and the different architectures of the pretrained models will also be analyzed further!

Name		Name	Last commit message	Last commit date
Latest commit History 176 Commits
.github/workflows		.github/workflows
.vscode		.vscode
html/pulmonary-disorder-detection-using-x-ray-images		html/pulmonary-disorder-detection-using-x-ray-images
models		models
modules		modules
notebooks		notebooks
numpy arrays		numpy arrays
static		static
templates		templates
upload		upload
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
app.py		app.py
docker-compose.debug.yml		docker-compose.debug.yml
docker-compose.yml		docker-compose.yml
entrypoint.sh		entrypoint.sh
gunicorn_config.py		gunicorn_config.py
requirements.txt		requirements.txt
sal.py		sal.py
wsgi.py		wsgi.py

License

Azariagmt/pulmonary-disorder-detection-using-x-ray-images

Folders and files

Latest commit

History

Repository files navigation

About

Objective

Usage

Option 1: Docker

Option 2:(no docker)

Datasets

Repository overview

Results

classification report

Confusion matrix for chosen models

Xception

DenseNet201

ResNet50V2

Conclusion

About

Topics

Resources

License

Stars

Watchers

Forks

Languages