Niko's Project Corner

From my office window I've got an unblocked size-view to the Ring Road I (Kehä I) in Espoo, Finland. It is one of the busiest roads in Finland, having up-to 100.000 cars / day. I wanted to create a program which would receive a video feed from a webcam and would process images in real time on common hardware.

Object tracking is fairly well studied problem already, but I wanted to take advantage of the special nature of this problem. Namely cars are known to move in a purely horizontal direction, and I didn't want to have complex background learning and separation code. I also didn't need to know cars locations precisely, but the main interest was on the number of cars and their velocities.

Figure 1: An example result of detecting and tracking cars from a side view. Green vertical lines indicate cars' estimated movement between adjacent frames.

The final output of the algorithm is visualized in Figure 1. Video frames are converted to gray-scale and high-pass filtered (the result can be seen in Figure 3). Then adjacent frames are subtracted from each other, and this is used to detect image locations which had significant changes in their brightnesses. These points are then used to generate a Delaunay triangulation. Too big triangles are removed from the mesh, and resulting disconnected sub-graphs would ideally cover a single car.

This process can merge cars together if the distance between them is too small, as seen in Figure 2, Luckily this problem can be mitigated in data post-processing via standard statistical methods and tracker stability analysis. Additionally too big regions can be ignored all-together. Points and graphs are plotted in blue in Figure 1, and red boxes bound separate "regions of interest" (ROIs). These are then individually analyzed.

Figure 2: A problematic video frame with a bus counted as three separate cars, and a few cars are merged together. The bus problem wouldn't occur with a better background separation algorithm.

Figure 3: A single car detection and velocity measurement example. The difference between frames is visualized in lower left corner, and red points indicate local difference maxima. The image on the right visualizes the cross-correlation for different image rows and offsets, mid-point is indicated in red and the maximum correlation is indicated by the green line.

Region of interest analysis steps are shown in Figure 3. Once a ROI boundaries are determined, the corresponding region is extracted from current and previous video frames. Previously generated images (converted to gray-scale and high-pass filtered) are re-used. Then the translation between these frames is efficiently, accurately and robustly determined by applying the Phase correlation method in 1D. The Fourier transform is calculated both images rows, these are multiplied together element-wise and the inverse Fourier transform is calculated. The outcome of this is shown in the right side of Figure 3.

To determine the translation amount between images, row-wise phase correlations are multiplied together, and its maximum value is determined. The location of the maximum directly determines the amount and direction of the translation. This signal is shown in Figure 4, and it has a single very distinct peak. This method doesn't rely on any interest point extraction, it isn't fooled by any partial repetitive patterns and no voting scheme is needed for robust outcome. Interest points were only used to detect and segment individual cars, not to actually track them.

Figure 4: A column-wise summation of correlation values. It is clear that the motion of this car between two frames is about 24 pixels. This could be converted to km/h by a simple calibration.

Overall the system would benefit from a better background model and background separation algorithm, but this simple code worked surprisingly well and it is only about 200 lines of Matlab code. It isn't 100% accurate, but most common errors can be detected and corrected by analyzing its output across multiple frames to detect outliers and incorrect detections. If the traffic gets heavy and car speeds drop, then this method would see it as an empty road since there is no movement. Also this issue would be fixed by better background separation code.

Cheating at Bananagrams with real-time AI, part 1, 2023 Apr (Matching: Computer Vision)
Introduction to Stable Diffusion's parameters, 2022 Nov (Matching: Computer Vision)
Matching puzzle pieces together, 2022 Jul (Matching: Computer Vision)
Single channel speech / music separation, 2022 Feb (Matching: FFT)
Image and video clustering with an autoencoder, 2022 Jan (Matching: Computer Vision)

Home	(Home page)
About	(About me)
Platform	(About this blog)

LinkedIn	(Niko Nyrhilä)
GitHub	(nikonyrh)
Stackoverflow	(nikonyrh)

Bruteforcing Countdown numbe...	(2023 Apr)
Cheating at Bananagrams with...	(2023 Apr)
Introduction to Stable Diffu...	(2022 Nov)
Matching puzzle pieces together	(2022 Jul)
Single channel speech / musi...	(2022 Feb)

Computer Vision	(13)
GitHub	(12)
Databases	(9)
Elasticsearch	(6)
FFT	(5)
Rendering	(5)
Applied mathematics	(4)

Real-time car tracking and counting

Related blog posts:

Home

Navigation

External

Most recent

Most frequent tags

Most frequent languages

Co-occurrence matrix

	Matl	Pyth	C++	Cloj	Bash	Kera
Comput	6	6	3	1	0	5
GitHub	0	2	1	4	3	0
Databa	0	3	2	2	1	0
Render	3	0	3	0	0	0
Nginx	0	1	0	0	4	0
Autoen	0	3	0	1	0	2
Elasti	0	2	0	3	0	0
FFT	3	1	1	0	0	1
Data S	2	1	2	1	0	1
JVM	0	1	0	3	1	0
Docker	0	1	0	0	3	0
FastCG	0	0	3	0	0	0
Applie	2	2	0	0	0	0
Field	2	0	2	0	0	0
Omnidi	2	0	2	0	0	0
Affine	2	0	2	0	0	0
Master	1	0	2	0	0	0
Archit	0	1	0	0	2	0
Visual	1	0	2	0	0	0
Spark	0	1	0	0	2	0
Blog	0	0	0	2	0	0
Hyphen	0	0	0	2	0	0
Stack	0	1	1	0	0	0
SQL	0	0	1	1	0	0
Busine	0	1	0	1	0	0
Signal	0	1	0	0	0	1
Encryp	0	0	0	0	1	0
Git	0	0	0	1	0	0
Stable	0	1	0	0	0	0
Redis	0	1	0	0	0	0
Thrust	0	0	1	0	0	0
Kibana	0	0	0	1	0	0
Astron	1	0	0	0	0	0
Mustac	0	0	1	0	0	0
NAT	0	0	0	0	1	0
jQuery	0	0	1	0	0	0
SSH	0	0	0	0	1	0
Happyh	0	0	1	0	0	0
Backup	0	0	0	0	1	0
Pthrea	0	0	1	0	0	0
AWS	0	0	0	0	1	0
SIFT	0	0	1	0	0	0
SURF	0	0	1	0	0	0
Conjug	0	0	1	0	0	0
Kalman	0	0	1	0	0	0
Partic	0	0	1	0	0	0
Gradie	0	0	1	0	0	0
Simult	0	0	1	0	0	0
Roboti	0	0	1	0	0	0
Princi	1	0	0	0	0	0
Receiv	1	0	0	0	0	0
Linear	1	0	0	0	0	0
Suppor	1	0	0	0	0	0
Machin	1	0	0	0	0	0
Discre	1	0	0	0	0	0