Georg Heiler
Georg Heiler
Home
Blog
Publications
Projects
Lecturing
Talks
Contact
Light
Dark
Automatic
apache-spark
Deterministic scale-out for spark jobs under increased load
Make spark jobs scale reliably using iteration
Georg Heiler
Last updated on Oct 29, 2020
2 min read
Spark and Hive 3
Get spark and Hive to play nice again on HDP 3.1
Georg Heiler
Last updated on Oct 29, 2020
2 min read
Parallel aggregation of dataframes
Use idempotency of RDD’s to your advantage
Georg Heiler
Last updated on Oct 29, 2020
1 min read
Geospatial binning with hexagons on spark
Bring hexagons as efficient spatial operations to spark
Georg Heiler
Last updated on Nov 28, 2019
2 min read
data processing
recent history of data processing.
Aug 4, 2019 9:00 AM — 11:00 AM
Yogyakarta, Indonesia
Georg Heiler
Follow
Spark descriptive name for cached dataframes
Display user friendly names for cached table in Spark web UI
Georg Heiler
Last updated on Nov 20, 2019
1 min read
Solve data skew issues for array columns in spark
Preventing data skew issues for Arrays.
Georg Heiler
Last updated on Nov 20, 2019
4 min read
Ultimate open vector geoprocessing on spark
Combine the strengths from geomesa and geospark for ultimate geoprocessing capabilities on spark
Georg Heiler
Last updated on Oct 29, 2020
4 min read
Analyze OSM data in spark
Analyze the OSM community and extract geometries from the graph.
Georg Heiler
Last updated on Oct 29, 2020
2 min read
OSM to Spark
Processing OSM in a scalable hadoop native way.
Georg Heiler
Last updated on Oct 29, 2020
6 min read
«
»
Cite
×