Georg Heiler
Georg Heiler
Home
Blog
Publications
Projects
Lecturing
Talks
Contact
Light
Dark
Automatic
Posts
Modern data orchestration using Dagster
Overview over the modern data stack ecosystem. Introduction to this blog series
Georg Heiler
,
Sandy Ryza
Last updated on Apr 22, 2022
5 min read
Interactive dagster debugging
Interacting with a running dagster instance interactively
Georg Heiler
Last updated on Feb 11, 2022
3 min read
Scalable sparse matrix multiplication
Using Apache Spark for
sparse
matrix multiplication
Georg Heiler
Last updated on Aug 9, 2021
6 min read
COVID population model
WWTF COVID project summary
Georg Heiler
Last updated on May 13, 2021
1 min read
ML project configuration management
Easy configuration handling for complex machine learning pipelines
Georg Heiler
Last updated on May 9, 2021
1 min read
Can you tell the nuts & berries apart in each group?
Guaranteed anonymity in high-dimensional data using differential privacy
Georg Heiler
Last updated on Aug 6, 2021
14 min read
Intersting links about deep learning
Useful links
Georg Heiler
Last updated on May 9, 2021
1 min read
Exact percentiles in Spark
Combining the power of Scala and Python to make the calculation of percentiles in Spark easy and fast
Georg Heiler
Last updated on Nov 22, 2020
7 min read
Arrow 2.0.0 - structs in pandas
Finally, nested types in Arrow.
Georg Heiler
Last updated on Nov 20, 2020
1 min read
Sparkling SCD2
Data preparation using spark without ACID tables
Georg Heiler
Last updated on Nov 19, 2020
4 min read
«
»
Cite
×