RealTimeOCR

RealTimeOCR is a computer vision project that combines YOLO (You Only Look Once) for object detection and PaddleOCR for optical character recognition (OCR) to identify and read text from objects in real-time video feeds. This project can be used for various applications, such as automated text extraction from documents, license plates, or any text-containing objects in videos.

Features

Real-time object detection using YOLO
Text extraction from detected objects using PaddleOCR
Customizable Region of Interest (ROI) for focused detection
Easy integration with video streams

Requirements

Make sure you have the following dependencies installed:

Python 3.7+
OpenCV
Pandas
NumPy
PaddleOCR
Ultralytics YOLO
CVZone

You can install the necessary packages using pip:

pip install opencv-python pandas numpy paddleocr ultralytics cvzone

Getting Started

Clone the repository:

git clone https://github.com/AmmarMohamed0/RealTimeOCR.git
cd RealTimeOCR

Download the YOLO weights:

Make sure to place your best.pt weights file in the project directory.

Prepare the class labels:

Create a coco.txt file in the project directory with the class labels (one per line).

Capture a video:

Place a sample video file named nr.mp4 in the project directory or modify the code to use your video file.

Run the project:

python YOLO10_and_PaaddleOCR.py

How It Works

The video feed is captured using OpenCV.
YOLO model predicts the bounding boxes for objects in the video.
The detected objects' bounding boxes are checked against a defined polygonal Region of Interest (ROI).
If an object is detected within the ROI, it is cropped, resized, and processed using PaddleOCR to extract any text.
The detected text is displayed on the video frame.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RealTimeOCR

Features

Requirements

Getting Started

How It Works

License

Acknowledgements

About

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
LICENSE		LICENSE
README.md		README.md
YOLO10_and_PaaddleOCR.py		YOLO10_and_PaaddleOCR.py
best.pt		best.pt
coco.txt		coco.txt
nr.mp4		nr.mp4

License

AmmarMohamed0/RealTimeOCR

Folders and files

Latest commit

History

Repository files navigation

RealTimeOCR

Features

Requirements

Getting Started

How It Works

License

Acknowledgements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages