ResearchSpace

Geospatial data stream processing in Python using FOSS4G components

Show simple item record

dc.contributor.author McFerren, Graeme
dc.contributor.author Van Zyl, T
dc.date.accessioned 2017-06-07T07:10:25Z
dc.date.available 2017-06-07T07:10:25Z
dc.date.issued 2016-07
dc.identifier.citation McFerren, G. and Van Zyl, T. 2016. Geospatial data stream processing in Python using FOSS4G components. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Volume XLI-B7, 2016 XIII ISPRS Congress, 12–19 July 2016, Prague, Czech Republic, p. 931-937. doi:10.5194/isprs-archives-XLI-B7-931-2016 en_US
dc.identifier.uri doi:10.5194/isprs-archives-XLI-B7-931-2016
dc.identifier.uri http://www.int-arch-photogramm-remote-sens-spatial-inf-sci.net/XLI-B7/931/2016/
dc.identifier.uri http://hdl.handle.net/10204/9173
dc.description The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Volume XLI-B7, 2016 XIII ISPRS Congress, 12–19 July 2016, Prague, Czech Republic en_US
dc.description.abstract One viewpoint of current and future IT systems holds that there is an increase in the scale and velocity at which data are acquired and analysed from heterogeneous, dynamic sources. In the earth observation and geoinformatics domains, this process is driven by the increase in number and types of devices that report location and the proliferation of assorted sensors, from satellite constellations to oceanic buoy arrays. Much of these data will be encountered as self-contained messages on data streams - continuous, infinite flows of data. Spatial analytics over data streams concerns the search for spatial and spatio-temporal relationships within and amongst data “on the move”. In spatial databases, queries can assess a store of data to unpack spatial relationships; this is not the case on streams, where spatial relationships need to be established with the incomplete data available. Methods for spatially-based indexing, filtering, joining and transforming of streaming data need to be established and implemented in software components. This article describes the usage patterns and performance metrics of a number of well known FOSS4G Python software libraries within the data stream processing paradigm. In particular, we consider the RTree library for spatial indexing, the Shapely library for geometric processing and transformation and the PyProj library for projection and geodesic calculations over streams of geospatial data. We introduce a message oriented Python-based geospatial data streaming framework called Swordfish, which provides data stream processing primitives, functions, transports and a common data model for describing messages, based on the Open Geospatial Consortium Observations and Measurements (O&M) and Unidata Common Data Model (CDM) standards. We illustrate how the geospatial software components are integrated with the Swordfish framework. Furthermore, we describe the tight temporal constraints under which geospatial functionality can be invoked when processing high velocity, potentially infinite geospatial data streams. The article discusses the performance of these libraries under simulated streaming loads (size, complexity and volume of messages) and how they can be deployed and utilised with Swordfish under real load scenarios, illustrated by a set of Vessel Automatic Identification System (AIS) use cases. We conclude that the described software libraries are able to perform adequately under geospatial data stream processing scenarios - many real application use cases will be handled sufficiently by the software. en_US
dc.language.iso en en_US
dc.publisher ISPRS en_US
dc.relation.ispartofseries Worklist;17947
dc.subject Geospatial data streaming platform en_US
dc.subject Data velocity en_US
dc.subject Python en_US
dc.subject FOSS4G geospatial libraries en_US
dc.subject SpS10-FOSS4G en_US
dc.title Geospatial data stream processing in Python using FOSS4G components en_US
dc.type Conference Presentation en_US
dc.identifier.apacitation McFerren, G., & Van Zyl, T. (2016). Geospatial data stream processing in Python using FOSS4G components. ISPRS. http://hdl.handle.net/10204/9173 en_ZA
dc.identifier.chicagocitation McFerren, Graeme, and T Van Zyl. "Geospatial data stream processing in Python using FOSS4G components." (2016): http://hdl.handle.net/10204/9173 en_ZA
dc.identifier.vancouvercitation McFerren G, Van Zyl T, Geospatial data stream processing in Python using FOSS4G components; ISPRS; 2016. http://hdl.handle.net/10204/9173 . en_ZA
dc.identifier.ris TY - Conference Presentation AU - McFerren, Graeme AU - Van Zyl, T AB - One viewpoint of current and future IT systems holds that there is an increase in the scale and velocity at which data are acquired and analysed from heterogeneous, dynamic sources. In the earth observation and geoinformatics domains, this process is driven by the increase in number and types of devices that report location and the proliferation of assorted sensors, from satellite constellations to oceanic buoy arrays. Much of these data will be encountered as self-contained messages on data streams - continuous, infinite flows of data. Spatial analytics over data streams concerns the search for spatial and spatio-temporal relationships within and amongst data “on the move”. In spatial databases, queries can assess a store of data to unpack spatial relationships; this is not the case on streams, where spatial relationships need to be established with the incomplete data available. Methods for spatially-based indexing, filtering, joining and transforming of streaming data need to be established and implemented in software components. This article describes the usage patterns and performance metrics of a number of well known FOSS4G Python software libraries within the data stream processing paradigm. In particular, we consider the RTree library for spatial indexing, the Shapely library for geometric processing and transformation and the PyProj library for projection and geodesic calculations over streams of geospatial data. We introduce a message oriented Python-based geospatial data streaming framework called Swordfish, which provides data stream processing primitives, functions, transports and a common data model for describing messages, based on the Open Geospatial Consortium Observations and Measurements (O&M) and Unidata Common Data Model (CDM) standards. We illustrate how the geospatial software components are integrated with the Swordfish framework. Furthermore, we describe the tight temporal constraints under which geospatial functionality can be invoked when processing high velocity, potentially infinite geospatial data streams. The article discusses the performance of these libraries under simulated streaming loads (size, complexity and volume of messages) and how they can be deployed and utilised with Swordfish under real load scenarios, illustrated by a set of Vessel Automatic Identification System (AIS) use cases. We conclude that the described software libraries are able to perform adequately under geospatial data stream processing scenarios - many real application use cases will be handled sufficiently by the software. DA - 2016-07 DB - ResearchSpace DP - CSIR KW - Geospatial data streaming platform KW - Data velocity KW - Python KW - FOSS4G geospatial libraries KW - SpS10-FOSS4G LK - https://researchspace.csir.co.za PY - 2016 T1 - Geospatial data stream processing in Python using FOSS4G components TI - Geospatial data stream processing in Python using FOSS4G components UR - http://hdl.handle.net/10204/9173 ER - en_ZA


Files in this item

This item appears in the following Collection(s)

Show simple item record