Advanced Medical AI Analysis & Prescription Platform

Developed By:

Pratheek Tiruangari
Ashruj Gautam

Project Overview

A cutting-edge medical diagnostic system leveraging state-of-the-art deep learning models and Large Language Models (LLM) for comprehensive patient analysis. This revolutionary platform combines medical imaging, natural language processing, and structured data analysis to deliver comprehensive medical assessments and tailored prescription recommendations.

Core Components

1. Medical Image Analysis Engine (CheXpert Transfer Learning)

Base Model: DenseNet-121 pretrained on ImageNet
Transfer Learning Process:
1. Initial Training on Stanford's CheXpert dataset (224,316 chest radiographs)
2. Domain adaptation using custom loss functions
3. Final transfer learning on the target CheXpert small X-ray dataset
Model Evaluation:
- After Transfer Learning:
  - Accuracy: 89.5%
  - F1-Score: 0.87
  - AUC-ROC: 0.92
  - Precision: 0.88
  - Recall: 0.86

2. Advanced NLP System (Clinical BERT Transfer Learning)

Base Model: BioBERT pretrained on PubMed abstracts
Transfer Learning Pipeline:
1. Initial pretraining on 2M medical documents
2. Domain adaptation on 100K radiology reports
3. Final transfer learning on the custom dataset
Architecture:
- 12 transformer layers
- 768 hidden dimensions
- 12 attention heads
- Medical vocabulary expansion: +18,000 tokens
Performance Evaluation:
- After Transfer Learning:
  - Accuracy: 67.1%
  - F1-Score: 0.87
  - Precision: 0.84
  - Recall: 0.91

3. Multi-Disease Classification System (Trained From Scratch)

Ensemble of specialized models for critical disease detection, each trained on carefully curated datasets:

a. Diabetes Prediction Model

Architecture: Custom Gradient Boosting Classifier
Dataset: BRFSS2015 Diabetes Dataset (253,680 samples)
Feature Engineering:
- 16 vital health markers optimization
- Advanced correlation analysis
- Custom feature scaling pipeline
Model Performance:
- Accuracy: 75.2%
- Precision: 0.73
- Recall: 0.79
- F1-Score: 0.76
- AUC-ROC: 0.83

b. Kidney Disease Analytics

Architecture: Enhanced Random Forest with Custom Preprocessing
Dataset: Chronic Kidney Disease Dataset (400 samples)
Feature Processing:
- 24 biological markers
- Missing value imputation
- Advanced feature selection
Model Performance:
- Accuracy: 97.5%
- Precision: 1.0
- Recall: 0.93
- F1-Score: 0.96
- AUC-ROC: 1.0

c. Heart Disease Detection

Architecture: Deep Neural Network
- Input Layer: 13 nodes
- Hidden Layers: [256, 128, 64] nodes
- Activation: ReLU + Batch Normalization
- Dropout: 0.3
Dataset: Cleveland Heart Disease Dataset (303 samples)
Model Performance:
- Accuracy: 88.5%
- Precision: 0.83
- Recall: 0.92
- F1-Score: 0.88
- AUC-ROC: 0.97

4. LLM-Powered Diagnostic Fusion & Prescription System

Model: Locally-hosted Llama2 LLM
Integration: Custom prompt engineering for medical context
Capabilities:
- Multi-modal result fusion
- Contextual prescription generation
- Drug interaction analysis
- Patient-specific medication adjustments
Features:
- Considers comorbidities in prescriptions
- Real-time medication compatibility checks
- Dosage optimization based on patient conditions

5. Interactive GUI System

Modern PyQt5-based interface
Drag-and-drop functionality for X-rays
Real-time analysis feedback
Integrated report generation
Export capabilities for medical records

Project Structure

├── Image/          # Image analysis models and scripts
├── Input/          # Sample input data
├── Scripts/        # Core Python processing scripts  
├── Table/          # Tabular data models
│   ├── Diabetes/
│   ├── Heart_Disease/
│   └── Kidney/
└── Text/           # Text analysis models

Setup & Installation

Create a Python virtual environment:

python -m venv venv
.\venv\Scripts\Activate.ps1

Install dependencies:

pip install -r requirements.txt

Run the application:

python gui.py

Note: Your system is expected to have Llama 3.2 installed locally.
If not, please download it here.

Development Guidelines

Security:
- Never commit credentials or tokens
- Use only relative paths
- Validate all user inputs
- Properly handle errors
- Follow least privilege principle
Data Protection:
- Use anonymized test data only
- Keep models and checkpoints local
- Clean output files of metadata
Code Quality:
- Follow PEP 8 style guide
- Add proper documentation
- Include error handling
- Write unit tests

Technical Architecture

Deep Learning Infrastructure

Framework: PyTorch with CUDA acceleration
Computing: GPU-optimized for real-time inference
Memory Management: Efficient batch processing
Parallelization: Multi-threaded data processing

1. Core Pipeline Architecture

The primary logic is implemented in run_all_models.py via the MedicalAIPipeline class. It integrates multiple AI models to perform comprehensive patient analysis and treatment recommendations.

A. Multi-Modal Input Processing

Image Analysis

Inputs chest X-ray images
Uses a DenseNet121 model with attention mechanisms
Performs multi-label classification for 13 lung diseases
Implementation: train_chexagent_model.py and related files

Text Analysis

Inputs radiology reports
Uses a BERT-based model fine-tuned on clinical text
Performs multi-label classification matching image model's outputs
Includes a fallback keyword-matching system

Tabular Data Analysis

Diabetes: XGBoost model on 21 features
Kidney Disease: Random Forest on 24 features
Heart Disease: MLP classifier using clinical inputs
All provide binary classification (Yes/No) with probability outputs

B. Model Integration

Disease Detection Fusion

Merges predictions from image and text models
Resolves conflicts using confidence scores

Risk Assessment

Combines heart, kidney, and diabetes results
Builds a complete comorbidity risk profile

C. LLM Integration

Takes merged predictions from all models
Generates structured medical prompts
Uses a local LLaMA model for recommendations
Considers comorbidities and contraindications
Implements fallback systems (Ollama, GPT4All, LM Studio)

2. LLM Fusion Strategy

The LLM layer acts as a high-level decision-making system that:

Aggregates Outputs

Integrates predictions from image, text, and tabular models
Utilizes confidence scores to balance conflicting results

Contextual Reasoning

Analyzes relations between multiple diseases
Evaluates risks posed by comorbidities like diabetes, heart, and kidney disease

Medical Knowledge Application

Applies clinical logic to suggest compatible treatments
Highlights drug contraindications based on comorbidity status
Adjusts or recommends alternative medications accordingly

Final Decision Support

Prioritizes treatment based on severity
Balances multiple therapies to avoid medical conflict
Produces a holistic treatment recommendation plan

Diagramatic Representation

Usage Example

sdf.mp4

Implementation Highlights

Advanced Model Training

Transfer Learning Pipeline:
- Pre-trained on large medical datasets
- Fine-tuned on specialized conditions
- Custom loss functions for medical context

Innovative LLM Integration

Custom Medical Knowledge Base:
- Drug interaction database
- Disease comorbidity patterns
- Treatment protocols
Context-Aware Processing:
- Patient history consideration
- Medication contradiction prevention
- Dynamic dosage adjustment

Scalable Architecture

Modular design for easy updates
Parallel processing capabilities
CPU/GPU flexible deployment
Containerization support

Future Enhancements

Integration with Electronic Health Records
Mobile application development
Cloud deployment options
Additional disease modules
Enhanced prescription automation

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
Image		Image
Input		Input
Scripts		Scripts
Table		Table
Text		Text
data		data
README.md		README.md
gui.py		gui.py
medical_analysis_results.txt		medical_analysis_results.txt
requirements.txt		requirements.txt
run_all_models.py		run_all_models.py

Pratheek-Tirunagari-and-Ashruj-Gautam/Multi-Modal-Health-Insights-Platform

Folders and files

Latest commit

History

Repository files navigation