2021-09-30, 15:30–16:00, Ushuaia
Many of the Apache projects serving the big data space do not come with out of the box support for geospatial data types like points, lines, and polygons. LocationTech GeoMesa has provided add-on support to Apache database projects such as Accumulo, Cassandra, HBase, and Redis crafting spatial and spatio-temporal keys. In addition to distributed databases, GeoMesa has enables spatial storage in many of the popular Apache file format projects such as Arrow, Avro, Orc, and Parquet. This talk will review the basics of big geo data persistence either in a data lake or in a database, and provide an overview of the benefits (and limitations) of each technology.
Many of the Apache projects serving the big data space do not come with out of the box support for geospatial data types like points, lines, and polygons. LocationTech GeoMesa has provided add-on support to Apache database projects such as Accumulo, Cassandra, HBase, and Redis crafting spatial and spatio-temporal keys. In addition to distributed databases, GeoMesa has enables spatial storage in many of the popular Apache file format projects such as Arrow, Avro, Orc, and Parquet. This talk will review the basics of big geo data persistence either in a data lake or in a database, and provide an overview of the benefits (and limitations) of each technology.
Jim Hughes, CCRi
Track –Software
Topic –Data collection, data sharing, data science, open data, big data, data exploitation platforms
Level –2 - Basic. General basic knowledge is required.
Language of the Presentation –English
Jim Hughes is a core committer for GeoMesa, which leverages HBase, Accumulo and other distributed systems to provide distributed computation and query capabilities. He is also a committer for LocationTech JTS and SFCurve.
Emilio Lahr-Vivaz is a software engineer focusing on geospatial big-data solutions, and a committer on Locationtech GeoMesa.