VehicleDetectionTracker is a personal Proof of Concept (POC) for real-time vehicle detection and tracking, powered by YOLO (You Only Look Once), built using Ultralytics. This project is an exploration into applying advanced object detection techniques for vehicle tracking, aiming to demonstrate the capabilities of YOLO for detecting and tracking vehicles in images and videos. ππ
- π Experiment with detecting vehicles in real-time or from pre-recorded videos.
- π Track vehicle positions and follow their movements across frames.
- π¨ Explore vehicle classification by brand and color, powered by MobileNet v3 (with Alibaba MNN backend).
- π This POC can serve as a foundation for applications like traffic analysis, surveillance, or automated vehicle monitoring.
- π Experiment with YOLO for precise object detection in dynamic environments.
This POC serves as a learning project to explore real-time vehicle tracking and detection using cutting-edge machine learning techniques. The VehicleDetectionTracker provides a simple framework to experiment with vehicle tracking, but itβs not intended to be a finished, production-ready product.
The project demonstrates how to leverage YOLO's powerful detection capabilities for various real-world scenarios, such as traffic monitoring or video analysis. While it is far from a complete system, the goal is to validate the approach and refine the application for future development.
One of the use cases explored in this POC is the integration with TrafficSentinel, a more complex project that uses the VehicleDetectionTracker for real-time traffic management and surveillance. While TrafficSentinel is intended as a more scalable, production-level solution, this POC serves as a testbed for concepts that might eventually be incorporated into future versions of the project.
π Explore the repository for more details, experiment with the code, and get started with your own vehicle detection and tracking tests! TrafficSentinel Architecture.
ππ I would like to extend my gratitude to the team at Spectrico for their car make and model recognition module. Thanks to their technology, my project has significantly improved in accuracy and efficiency in vehicle identification. You can find more information about their car make and model recognition classifier here.
VehicleDetectionTracker is an experimental project designed for advanced learning and research purposes in the field of computer vision. It leverages YOLO object detection and tracking technologies for vehicle detection and tracking, as well as integrates Car Make and Model classification and vehicle color recognition features, powered by Spectricoβs open-source tools.
The Car Make and Model classifier is based on YOLOv3 for object detection, and MobileNet for classification, allowing accurate identification of the make, model, and color of vehicles. This classifier works with YOLOv3 for vehicle detection and MobileNet for identifying vehicle makes and models. Please note that the project relies on Spectricoβs car recognition technology and their open-source repositories:
- Spectrico Car Make and Model Classifier YOLOv3 Python
- Car Make and Model Classifier YOLOv3 Python - Soheil Paper
- Spectrico Car Color Classifier YOLOv3 Python
This project is intended solely for educational exploration and is not intended for commercial or production-level use.
- Efficient vehicle detection and tracking in images and videos.
- Accurate vehicle speed estimation in kilometers per hour (KM/H).
- Precise determination of the direction of each detected vehicle.
- Vehicle color estimation.
- Easy integration into your computer vision projects.
Vehicle detection is a crucial component of many traffic management and surveillance systems. In this section, we'll explain how vehicle detection works using a simplified description.
- The process begins with a video input, typically captured from cameras placed in various locations, such as roads, highways, or parking lots.
- The video is composed of a sequence of frames (images). Vehicle detection works on each frame individually. These frames are extracted from the video stream.
- A specialized model, often based on YOLO (You Only Look Once) or similar deep learning frameworks, is used for vehicle detection. This model is pre-trained to recognize vehicles.
- Each frame is processed by the YOLO model, which analyzes the image and identifies objects present. In this case, it's looking for vehicles. The model detects vehicles by drawing bounding boxes around them and assigning a confidence score to each detection.
- To keep track of vehicles over time, the system uses a tracking mechanism. This ensures that the same vehicle is consistently recognized as it moves through the frames. Tracking may involve assigning unique IDs to vehicles and following their movements.
- The system calculates several attributes for each detected vehicle:
- Speed: By analyzing the position of the vehicle in successive frames and measuring the time difference, the system calculates the vehicle's speed.
- Direction: It determines the vehicle's direction of movement by comparing its position in different frames. This can be done by calculating the angle of motion.
- Reliability: The reliability of the calculated speed and direction is assessed based on the number of samples used. More samples often lead to higher reliability.
- Information about each detected vehicle, such as its type (car, truck, etc.), coordinates (position in the frame), speed, direction, and reliability, is collected and stored.
- Annotated frames are generated by adding visual overlays to the original frames. These overlays include tracking lines and bounding boxes around detected vehicles.
- The annotated frames can be displayed for real-time monitoring, analysis, or traffic management. They provide a visual representation of vehicle movements, and the collected data can be used for various purposes, including traffic flow analysis, incident detection, and more.
- The processed data, including vehicle attributes and annotated frames, can be stored, analyzed, or further processed according to the requirements of the application or system.
In summary, vehicle detection involves using a deep learning model to identify vehicles in video frames, tracking their movements, calculating attributes like speed and direction, and providing this information for display or analysis. This technology is a key component in traffic management, surveillance, and various other applications where vehicle monitoring is essential.
0: 384x640 1 car, 171.0ms
Speed: 5.0ms preprocess, 171.0ms inference, 2.0ms postprocess per image at shape (1, 3, 384, 640)
{
"number_of_vehicles_detected": 1, // Number of vehicles detected in this frame
"detected_vehicles": [ // List of information about detected vehicles
{
"vehicle_id": 5, // Unique identifier for the detected vehicle
"vehicle_type": "car", // Type of the detected vehicle (e.g., "car", "truck")
"detection_confidence": 0.4797525107860565, // Confidence score of the detection, indicating how accurate the model believes the detection is
"color_info": '[{"color": "Black", "prob": "0.2470433"}, {"color": "White", "prob": "0.21236196"}, {"color": "Blue", "prob": "0.15445676"}]', // Detected vehicle color information with respective probabilities
"model_info": '[{"make": "Jeep", "model": "Wrangler", "prob": "0.022544071"}, {"make": "Ford", "model": "Mustang", "prob": "0.014573687"}, {"make": "Honda", "model": "Civic", "prob": "0.010537574"}]', // Detected vehicle make and model information with respective probabilities
"speed_info": { // Information about the vehicle's speed
"kph": 68.1191847125589, // Calculated speed in kilometers per hour (kph)
"reliability": 1.0, // Reliability of the speed measurement
"direction_label": "Top Left", // Label indicating the direction of the vehicle (e.g., "Top Left" means moving towards the top left of the frame)
"direction": -2.2559461480857093 // Direction of movement in radians
},
"vehicle_coordinates": { // Coordinates and dimensions of the detected vehicle in the frame
"x": 123.45, // Horizontal position (x-axis) of the vehicle in the frame
"y": 678.90, // Vertical position (y-axis) of the vehicle in the frame
"width": 50.0, // Width of the bounding box around the vehicle
"height": 30.0 // Height of the bounding box around the vehicle
},
"vehicle_frame_base64": "...", // Base64 encoded image of the video frame where the vehicle was detected
"vehicle_frame_timestamp": "2024-08-10T15:30:45.123456Z" // Timestamp indicating the exact moment when the video frame was captured
}
]
}
You can quickly get started with VehicleDetectionTracker to detect and track vehicles in images and videos. Below are two usage examples, each tailored to different scenarios:
This example demonstrates how to use VehicleDetectionTracker to process a real-time video stream using OpenCV. Simply provide the URL of the video stream to get started:
from VehicleDetectionTracker.VehicleDetectionTracker import VehicleDetectionTracker
video_path = "[[YOUR_STREAMING_SOURCE]]"
vehicle_detection = VehicleDetectionTracker()
result_callback = lambda result: print({
"number_of_vehicles_detected": result["number_of_vehicles_detected"],
"detected_vehicles": [
{
"vehicle_id": vehicle["vehicle_id"],
"vehicle_type": vehicle["vehicle_type"],
"detection_confidence": vehicle["detection_confidence"],
"color_info": vehicle["color_info"],
"model_info": vehicle["model_info"],
"speed_info": vehicle["speed_info"]
}
for vehicle in result['detected_vehicles']
]
})
vehicle_detection.process_video(video_path, result_callback)
For more advanced use cases, VehicleDetectionTracker can also be integrated with Apache Kafka for processing video frame by frame:
from confluent_kafka import Consumer, KafkaError
from VehicleDetectionTracker.VehicleDetectionTracker import VehicleDetectionTracker
import json
conf = {
'bootstrap.servers': '192.168.1.39:9092"', # Configure this to your Kafka broker's address.
'group.id': 'my-group-1',
'auto.offset.reset': 'earliest'
}
topic = 'iot-camera-frames'
consumer = Consumer(conf)
consumer.subscribe([topic])
vehicleDetection = VehicleDetectionTracker()
while True:
msg = consumer.poll(1.0)
if msg is None:
continue
if msg.error():
if msg.error().code() == KafkaError._PARTITION_EOF:
print('End of partition, message read: {} [{}] at offset {}'.format(
msg.key(), msg.partition(), msg.offset()))
else:
print('Kafka error: {}'.format(msg.error()))
else:
# Process the message (a frame) received from Kafka
payload = json.loads(msg.value())
mac_address = payload.get('mac_address', '')
frame_timestamp = payload.get('frame_timestamp', '')
frame_data = payload.get('frame_data', '')
# Process the frame with the tracker
results = vehicleDetection.process_frame_base64(frame_data, frame_timestamp)
# Optionally, you can access the MAC address and timestamp for further processing
print(f"MAC Address: {mac_address}")
print(f"Timestamp: {frame_timestamp}")
result_callback = lambda result: print({
"number_of_vehicles_detected": result["number_of_vehicles_detected"],
"detected_vehicles": [
{
"vehicle_id": vehicle["vehicle_id"],
"vehicle_type": vehicle["vehicle_type"],
"detection_confidence": vehicle["detection_confidence"],
"color_info": vehicle["color_info"],
"model_info": vehicle["model_info"],
"speed_info": vehicle["speed_info"]
}
for vehicle in result['detected_vehicles']
]
})
result_callback(results)
consumer.close()
These examples showcase the flexibility of VehicleDetectionTracker and its ability to adapt to various real-world scenarios. Explore the repository's documentation and examples for more in-depth guidance.
Here are some screenshots that demonstrate the functionality of VehicleDetectionTracker:
Screenshot 1 | Screenshot 2 |
---|---|
Screenshot 3 | Screenshot 4 |
---|---|
Screenshot 5 | Screenshot 6 |
---|---|
Screenshot 7 | Screenshot 8 |
---|---|
Screenshot 9 | Screenshot 10 |
---|---|
Screenshot 11 | Screenshot 12 |
---|---|
VehicleDetectionTracker is an experimental project designed for advanced learning and research purposes in the field of computer vision. It leverages YOLO object detection and tracking technologies for vehicle detection and tracking, as well as integrates Car Make and Model classification and vehicle color recognition features, powered by Spectricoβs open-source tools.
The Car Make and Model classifier is based on YOLOv3 for object detection, and MobileNet for classification, allowing accurate identification of the make, model, and color of vehicles. This classifier works with YOLOv3 for vehicle detection and MobileNet for identifying vehicle makes and models. Please note that the project relies on Spectricoβs car recognition technology and their open-source repositories:
- Spectrico Car Make and Model Classifier YOLOv3 Python
- Car Make and Model Classifier YOLOv3 Python - Soheil Paper
- Spectrico Car Color Classifier YOLOv3 Python
This project is intended solely for educational exploration and is not intended for commercial or production-level use.
This project is licensed under the MIT License - see the LICENSE file for details.
- This package is powered by ultralytics YOLO for object detection.
- Special thanks to the open-source community for their contributions.
- I would like to extend my gratitude to the team at Spectrico for their car make and model recognition module. Thanks to their technology, my project has significantly improved in accuracy and efficiency in vehicle identification. You can find more information about their car make and model recognition classifier here.
Contributions to VehicleDetectionTracker are highly encouraged! If you're interested in adding new features, resolving bugs, or enhancing the project's functionality, please feel free to submit pull requests.
VehicleDetectionTracker is developed and maintained by Sergio SΓ‘nchez SΓ‘nchez (Dream Software). Special thanks to the open-source community and the contributors who have made this project possible. If you have any questions, feedback, or suggestions, feel free to reach out at [email protected].