Niko's Project Corner

This is an alternative answer to the question I encountered at Stack Overflow about fuzzy searching of hashes on Elasticsearch. My original answer used locality-sensitive hashing. Superior speed and simple implementation were gained by using nVidia's CUDA via Thrust library.

Actually this project didn't need more than a few dozen lines of code to solve the problem, most code is about benchmarking the code with varying problem sizes. Hamming distance was calculated as the Hamming weight of the two 64-bit integers, the used algorithm is "popcount_3" of Wikipedia article. Interestingly it was more efficient to calculate the sum of two 32-bit "halves" weights instead of calculating it for all 64-bits at once, it would take more benchmarking and profiling to find out why this was the case.

Anyway, my laptop's GeForce GT 650M was able to scan through 700 million 64-bit hashes / second. This was far superior performance that I got form Elasticsearch which would need 100 milliseconds to search through 1 million documents but for CUDA it takes only 2.5 milliseconds! The performance difference gets even greater when more documents are searched, for example for 7 million images it took 6 - 7 seconds for ES but only 10 milliseconds for CUDA. Also with this solution we get 100% accuracy on results.

Kernel launch incurs some overhead but it diminishes when searching on more than 5 million images.

Figure 1: Search performance of 700 million images / seconds was reached.

The CUDA solution could be made HTTP-accessible by for example implementing a FastCGI interface to the script and having a Nginx web server routing requests there. This is very easy to set-up especially if image hashes don't change often and fit in single machine's GPU's memory.

Very fuzzy searching with Elasticsearch, 2015 Oct (Matching: Databases, GitHub, Stack Overflow)
Benchmarking Elasticsearch and MS SQL on NYC Taxis, 2017 May (Matching: Databases, GitHub)
Analyzing NYC Taxi dataset with Elasticsearch and Kibana, 2017 Mar (Matching: Databases, GitHub)
Mustache templates in Clojure, 2017 Jan (Matching: GitHub)
An efficient schema for hierarchical data on Elasticsearch, 2016 Nov (Matching: Databases)

Home	(Home page)
About	(About me)
Platform	(About this blog)

LinkedIn	(Niko Nyrhilä)
GitHub	(nikonyrh)
Stackoverflow	(nikonyrh)

Bruteforcing Countdown numbe...	(2023 Apr)
Cheating at Bananagrams with...	(2023 Apr)
Introduction to Stable Diffu...	(2022 Nov)
Matching puzzle pieces together	(2022 Jul)
Single channel speech / musi...	(2022 Feb)

Computer Vision	(13)
GitHub	(12)
Databases	(9)
Elasticsearch	(6)
FFT	(5)
Rendering	(5)
Applied mathematics	(4)

Very fuzzy searching with CUDA

Related blog posts:

Home

Navigation

External

Most recent

Most frequent tags

Most frequent languages

Co-occurrence matrix

	Matl	Pyth	C++	Cloj	Bash	Kera
Comput	6	6	3	1	0	5
GitHub	0	2	1	4	3	0
Databa	0	3	2	2	1	0
Render	3	0	3	0	0	0
Nginx	0	1	0	0	4	0
Autoen	0	3	0	1	0	2
Elasti	0	2	0	3	0	0
FFT	3	1	1	0	0	1
Data S	2	1	2	1	0	1
JVM	0	1	0	3	1	0
Docker	0	1	0	0	3	0
FastCG	0	0	3	0	0	0
Applie	2	2	0	0	0	0
Field	2	0	2	0	0	0
Omnidi	2	0	2	0	0	0
Affine	2	0	2	0	0	0
Master	1	0	2	0	0	0
Archit	0	1	0	0	2	0
Visual	1	0	2	0	0	0
Spark	0	1	0	0	2	0
Blog	0	0	0	2	0	0
Hyphen	0	0	0	2	0	0
Stack	0	1	1	0	0	0
SQL	0	0	1	1	0	0
Busine	0	1	0	1	0	0
Signal	0	1	0	0	0	1
Encryp	0	0	0	0	1	0
Git	0	0	0	1	0	0
Stable	0	1	0	0	0	0
Redis	0	1	0	0	0	0
Thrust	0	0	1	0	0	0
Kibana	0	0	0	1	0	0
Astron	1	0	0	0	0	0
Mustac	0	0	1	0	0	0
NAT	0	0	0	0	1	0
jQuery	0	0	1	0	0	0
SSH	0	0	0	0	1	0
Happyh	0	0	1	0	0	0
Backup	0	0	0	0	1	0
Pthrea	0	0	1	0	0	0
AWS	0	0	0	0	1	0
SIFT	0	0	1	0	0	0
SURF	0	0	1	0	0	0
Conjug	0	0	1	0	0	0
Kalman	0	0	1	0	0	0
Partic	0	0	1	0	0	0
Gradie	0	0	1	0	0	0
Simult	0	0	1	0	0	0
Roboti	0	0	1	0	0	0
Princi	1	0	0	0	0	0
Receiv	1	0	0	0	0	0
Linear	1	0	0	0	0	0
Suppor	1	0	0	0	0	0
Machin	1	0	0	0	0	0
Discre	1	0	0	0	0	0

Python	(13)
C++	(11)
Matlab	(10)
Keras	(6)
Clojure	(6)
Bash	(6)
PHP	(6)