Skip to main content

Importing OSM (PBF file) to PostGIS (PGSNAPSHOT schema)

OSM Data can be used for variety of usecases like rendering, searching, data analytics etc. Each usecase yield quickest result in a particular data format. Due to the large volume of data, this can cause significant difference in the response times. 

Some of these formats are,
  1. PBF -- A highly compressed format used almost exclusively for storage and downloads owing to the small sizes. These format are not human readable and gzip compressed.
  2. PostgreSQL with PostGIS extension -- A data format backed up by PostgreSQL database. There are various schemas supported for multiple use cases which can greatly impact the response time as indexes and normalization are used to speed up queries. Some popular schemas along with their usecase and certain features are as below. The table is taken directly from openstreetmap wiki.

Schema nameCreated withUsed byPrimary use caseUpdatableGeometries (PostGIS)Losslesshstore columnsDatabase
osm2pgsqlosm2pgsqlMapnikKothic JSRenderingyesyesnooptionalPostgreSQL
apidbosmosisAPIMirroringyesnoyesnoPostgreSQL, MySQL
pgsnapshotosmosisjXAPIAnalysisyesoptionalyesyesPostgreSQL
imposmImposmRenderingnoyesnoImposm2: no, Imposm3: yesPostgreSQL
nominatimosm2pgsqlNominatimSearch, Geocodingyesyesyes?PostgreSQL
ogr2ogrogr2ogrAnalysisnoyesnooptionalvarious
osmsharpOsmSharpRoutingyesno??Oracle
overpassOverpass APIAnalysisyes?yes?custom
mongosmMongOSMAnalysismaybe???MongoDB
node-mongosmMongoosejsAnalysisyesyesyesNAMongoDB
goosmgoosmAnalysisnoyesyesNAMongoDB
pbf2mongopbf2mongoAnalysisnoyesyesNAMongoDB
waychangeSQLstreetkeysmvData cache and Analysisonly a schemaoptional?noPostgreSQL
In this blog, we'll download a PBF file containing our region of interest and insert the data into a PostgreSQL database as PGSNAPSHOT schema.

Get PBF Data

Visit https://download.geofabrik.de/, there you'll find OSM data for continent and countries (and cities in some cases). 

Install osmosis

Visit https://github.com/openstreetmap/osmosis/releases/latest, and download the gzipped tarball containing osmosis binaries.

# mkdir osmosis
# mv osmosis-latest.tgz osmosis
# cd osmosis
# tar xvfz osmosis-latest.tgz
# rm osmosis-latest.tgz

Create Dataqbase and import PGSNAPSHOT schema

# createdb gis
# psql -d gis -c "CREATE extension IF NOT EXISTS postgis; CREATE extension IF NOT EXISTS hstore"
# psql -d gis -f osmosis/script/pgsnapshot_schema_0.6.sql

At a later stage, these scripts can be run for specific queries,

# psql -d pgsnapshot -f osmosis_dir/script/pgsnapshot_schema_0.6_action.sql 

# psql -d pgsnapshot -f osmosis_dir/script/pgsnapshot_schema_0.6_bbox.sql 

# psql -d pgsnapshot -f osmosis_dir/script/pgsnapshot_schema_0.6_linestring.sql

Import PBF data

osmosis/bin/osmosis \
--read-pbf file="asia.osm.pbf" \
--write-pgsql host="localhost" database="gis" user="" password=""

Try out GeoQuery

select st_distance(geom, 'SRID=4326;POINT(-74.1235 35.3521)'::geometry) as dist, \
tags->'name' as name from nodes where tags?'place' and tags->'place' \
in ('city','muncipality','town') order by dist;

Comments

Popular posts from this blog

Multimaster replication with Symmetric DS

Symmetric DS is an awesome tool for trigger based replication whcih works for all major database vendors, including but not limited to PostgreSQL, MySQL, MSSQL, Oracle and many others. Symmetric-DS is a java application and can execute on any platform on whcih JRE is available including Windows and Linux. Trigger based replication, in constrast to disk based (eg. DRBD ) or transaction log file shipping based or statement based , works by registering triggers on DMLs and sending the data thus generated to remote machines. Another very popular trigger based DB replication tool is Slony . Symmetric-DS in addition to being database agnostic also supports multi-master replication (MMR). MMR usecase involves multiple database nodes, connected in a pool with DML updates coming from any of them. This is different from the normal master-slave replication, where slaves are not expected to generate any data events, and the sole authority of database is the master. MMR requirement causes d...

Motorola XT502 Custom ROMs

I purchased an Android Phone in the early days of it coming to Indian market and I able to afford it ( just got a job), during mid 2010. The recent and popular version was Éclairs. I just went up to the shop and bought a nice and shiny Motorola Quench XT3 (XT502). When in other places XT502 was having Andriod Donut, I got as a special offer Éclairs. I was happy as hell. The Droid A couple of year passes, and newer versions of Android came from Éclairs to Froyo to Gingerbread then to the bigger version upgrades like Honeycomb, Ice Cream Sandwich and then Jelly Bean. By the end of 2012, I was literally surviving on my Motorola with Éclairs. I had to upgrade, anyhow. Now more that Motorola denied any upgrades for Quench XT3 . A trivia, Android version names are taken from sugary desserts with lexicographic sequencing. So, I did up-gradation from Éclairs to Gingerbread using a custom ROM from Cyanogenmod . An excellent community of enthusiast who develop their own custom ROMs oft...

RabbitMQ and SSL

RabbitMQ is an AMQP provider i.e. it can reliably queue, service and maintain messages according to a range of policies and parameters. By default, it listens to plain old TCP connections and sends and receives messages over plaintext. This feature just works "out of the box". For users who wish to use SSL over TCP aka TLS, it requires a bit more work on their part. First, let's create a bunch of certificates and sign them with our own CA. For this, we'll use easyrsa3 . Easyrsa is a CLI tool to create, sign and manage your own certification authorities. It's maintained by OpenVPN team. Download easyrsa using your native package manager i.e. yum or apt-get $cp -Rp /usr/share/easy-rsa ~/easy-rsa-3   $cd ~/easy-rsa-3 $./easyrsa init-pki $./easyrsa build-ca $./easyrsa build-server-full broker [nopass] $./easyrsa build-client-full client1 [nopass] This creates three entities (collection of private keys, public keys and certificates) for a CA, a s...