Processing large raster and vector data in apache spark

Hagedorn, Stefan GND; Räth, Timo; Birli, Oliver; Sattler, Kai-Uwe GND

Spatial data processing frameworks in many cases are limited to vector data only. However, an important type of spatial data is raster data which is produced by sensors on satellites but also by high resolution cameras taking pictures of nano structures, such as chips on wafers. Often the raster data sets become large and need to be processed in parallel on a cluster environment. In this paper we demonstrate our STARK framework with its support for raster data and functionality to combine raster and vector data in filter and join operations. To save engineers from the burden of learning a programming language, queries can be formulated in SQL in a web interface. In the demonstration, users can use this web interface to inspect examples of raster data using our extended SQL queries on a Apache Spark cluster.

Cite

Citation style:
Hagedorn, S., Räth, T., Birli, O., Sattler, K.-U., Tagung Datenbanksysteme für Business, T. und W. (BTW), (Rostock), ., 2019. Processing large raster and vector data in apache spark. Datenbanksysteme f{\"u}r Business, Technologie und Web (BTW 2019): 18. Fachtagung des GI-Fachbereichs „Datenbanken und Informationssysteme`` (DBIS) : 4.-8. M{\"a}rz 2019 in Rostock, Deutschland, Datenbanksysteme f{\"u}r Business, Technologie und Web (BTW 2019): 18. Fachtagung des GI-Fachbereichs „Datenbanken und Informationssysteme`` (DBIS) : 4.-8. M{\"a}rz 2019 in Rostock, Deutschland 289, 2019, 551–554. https://doi.org/10.18420/btw2019-43
Could not load citation form. Default citation form is displayed.

Rights

Use and reproduction:

Export