Talk

Understanding geospatial data with duckdb

LanguageEnglish
Audience levelBeginner
Elevator pitch

Usually, scientists and developers work and visualize geospatial data in many applications. However, lack of understanding generates confusion and adds difficulty on daily tasks. This talk aims to give attendees concepts about what is, and work with geospatial data in a modern way using duckdb

Abstract

Geospatial data can be defined as information regarding objects or events that include location as additional attributes. This means that you can map features that represent real-world phenomena into latitude and longitude coordinates from earth’s surface. With this, you can perform analysis, generate insights and create a clear picture of a given situation. For example, in an emergency context, humanitarian organizations apply geospatial analysis to find health facilities nearby an earthquake epicenter, improving operations while reducing response times.

We can leverage the understanding of different circumstances by applying location information in a given context. However, it becomes necessary to have well-defined concepts and terminology. Working with geospatial data requires a minimum basis that can facilitate the daily work of many developers and analysts.

In this talk, I will provide an introduction of performing taks and working with geospatial data using duckdb. We will take advantage of this modern tool, that provides it users the ease of getting started, working with it and also integrate with different programming languages, such as Python. At the end of this talk, attendees will have a clear view about what is geospatial data, how to collect it and run geospatial operations. I will select common and open data sources used in emergency context. However, the knowledge can be re-used in any real world scenario.

The schedule of this talk is the following:

  • Installation of duckdb (10 seconds).
  • Basics of GIS (10 minutes).
  • Loading geospatial data in duckdb (5 minutes) using custom functions.
  • Operations with vector data (intersection of polygons, aggregation of points, etc.) (10 minutes).
  • Data export using geospatial standards (3 minutes).
  • Common issues found (2 minutes).
TagsDatabases, Applications, Scientific Python
Participant

Jorge Martinez Gomez

Hello! I’m Jorge Martinez, currently working as a geospatial software engineer at the World Food Programme. My focus of work is mostly on geo-data collection and analysis to handle emergencies at a global level My background is in electronic engineering, with a master in computer science. I did research in computer vision and machine learning and then jumped into GIS focusing of development of different open source tools, ranging from spatial database infrastructure, to OpenStreetMap-derived tools. On my free time, I like to mix music, visit historical museums and try exotic food.