Niko's Project Corner

Often I find myself having a SSH connection to a remote server, and I'd like to retrieve some files to my own machine. Common methods for this include Windows/Samba share, SSHFS and upload to cloud (which isn't trivial to do via plain cURL). Here an easy-to-use alternative is described: a single line command to load and run a docker image which contains a pre-configured Nginx instance. Then files can be accessed via plain HTTP at the user-assigned port (assuming firewall isn't blocking it).

I found that writing Dockerfiles is way easier than for example make files, maybe because its operations closely match those you'd execute via bash anyway when setting up a new box. Additionally there are convenient published images to base your images on, thus minimizing the number of custom steps you need to think of and implement.

The implemented docker image is based "FROM nginx:1.9", and just contains a custom nginx.conf and main.sh files. When docker run -p 1234:80 -v "$PWD:/volume" -d nikonyrh/nginx_bridge is executed it starts the container, mounts current working directory to /volume path (could be read-only) and exposes its contents as Nginx auto-indexed folder at http://localhost:1234. By default access log is available at http://localhost:1234/logs/logx.txt but it can be disabled with -no-log flag at start-up. The image is about 190 MB, gzip compresses it down to 72 MB and Dockerhub says it is 75 MB.

Also some efficiency experiments were run. A few gigabytes of JPG images (40 - 400 kB in size) were transferred and at best 90% of the 1 Gbps bandwidth was achieved. Files were transferred from an Ubuntu server to a router, to a Windows machine running Ubuntu in a VirtualBox, via curl to /dev/null. Resulting bandwidth is shown in Figure 1. Parallel execution was achieved via xargs, thus the overhead of three-way TCP handshake was significant unless the cURL processes fetched multiple images.

Figure 1: Achieved bandwidth on transferring medium-size files over 1 Gb etherned and HTTP.

In conclusion this seems to be a viable way of distributing JPG images to other machines within the LAN for further processing. The first task might be calculating color histograms of webcam images on different calendar dates and times of day. Calculation distribution will be handled by the Spark framework. Also it would be interesting to measure this performance to alternatives such as HDFS.

Scalable analytics with Docker, Spark and Python, 2015 Dec (Matching: Bash, Docker, GitHub, Nginx, Spark)
Service discovery with Docker, Consul and Registrator, 2016 Aug (Matching: Bash, Docker, Nginx)
Automated image capturing + API, 2015 Apr (Matching: Bash, GitHub)
Benchmarking Elasticsearch and MS SQL on NYC Taxis, 2017 May (Matching: GitHub)
Publishing internal services behind a NAT, 2014 Sep (Matching: Bash, Nginx)

Home	(Home page)
About	(About me)
Platform	(About this blog)

LinkedIn	(Niko Nyrhilä)
GitHub	(nikonyrh)
Stackoverflow	(nikonyrh)

Bruteforcing Countdown numbe...	(2023 Apr)
Cheating at Bananagrams with...	(2023 Apr)
Introduction to Stable Diffu...	(2022 Nov)
Matching puzzle pieces together	(2022 Jul)
Single channel speech / musi...	(2022 Feb)

Computer Vision	(13)
GitHub	(12)
Databases	(9)
Elasticsearch	(6)
FFT	(5)
Rendering	(5)
Applied mathematics	(4)

Nginx docker image for easy file access via HTTP

Related blog posts:

Home

Navigation

External

Most recent

Most frequent tags

Most frequent languages

Co-occurrence matrix

	Matl	Pyth	C++	Cloj	Bash	Kera
Comput	6	6	3	1	0	5
GitHub	0	2	1	4	3	0
Databa	0	3	2	2	1	0
Render	3	0	3	0	0	0
Nginx	0	1	0	0	4	0
Autoen	0	3	0	1	0	2
Elasti	0	2	0	3	0	0
FFT	3	1	1	0	0	1
Data S	2	1	2	1	0	1
JVM	0	1	0	3	1	0
Docker	0	1	0	0	3	0
FastCG	0	0	3	0	0	0
Applie	2	2	0	0	0	0
Field	2	0	2	0	0	0
Omnidi	2	0	2	0	0	0
Affine	2	0	2	0	0	0
Master	1	0	2	0	0	0
Archit	0	1	0	0	2	0
Visual	1	0	2	0	0	0
Spark	0	1	0	0	2	0
Blog	0	0	0	2	0	0
Hyphen	0	0	0	2	0	0
Stack	0	1	1	0	0	0
SQL	0	0	1	1	0	0
Busine	0	1	0	1	0	0
Signal	0	1	0	0	0	1
Encryp	0	0	0	0	1	0
Git	0	0	0	1	0	0
Stable	0	1	0	0	0	0
Redis	0	1	0	0	0	0
Thrust	0	0	1	0	0	0
Kibana	0	0	0	1	0	0
Astron	1	0	0	0	0	0
Mustac	0	0	1	0	0	0
NAT	0	0	0	0	1	0
jQuery	0	0	1	0	0	0
SSH	0	0	0	0	1	0
Happyh	0	0	1	0	0	0
Backup	0	0	0	0	1	0
Pthrea	0	0	1	0	0	0
AWS	0	0	0	0	1	0
SIFT	0	0	1	0	0	0
SURF	0	0	1	0	0	0
Conjug	0	0	1	0	0	0
Kalman	0	0	1	0	0	0
Partic	0	0	1	0	0	0
Gradie	0	0	1	0	0	0
Simult	0	0	1	0	0	0
Roboti	0	0	1	0	0	0
Princi	1	0	0	0	0	0
Receiv	1	0	0	0	0	0
Linear	1	0	0	0	0	0
Suppor	1	0	0	0	0	0
Machin	1	0	0	0	0	0
Discre	1	0	0	0	0	0

Python	(13)
C++	(11)
Matlab	(10)
Keras	(6)
Clojure	(6)
Bash	(6)
PHP	(6)