Georg Heiler
Georg Heiler
Home
Blog
Publications
Projects
Lecturing
Talks
Contact
Light
Dark
Automatic
apache-spark
AI basierte Root Cause Analyse von CPD Störquellen in Docsis Netzen
Good quality network connectivity is ever more important. For hybrid fiber coaxial (HFC) networks, searching for upstream \emph{high noise} in the past was cumbersome and time-consuming. Even with machine learning due to the heterogeneity of the network and its topological structure, the task remains challenging. We present the automation of a simple business rule (largest change of a specific value) and compare its performance with state-of-the-art machine-learning methods and conclude that the precision@1 can be improved by 2.3 times. As it is best when a fault does not occur in the first place, we secondly evaluate multiple approaches to forecast network faults, which would allow performing predictive maintenance on the network.
May 10, 2022 12:00 AM — May 12, 2022 12:00 AM
Georg Heiler
PDF
Slides
Comparing SQL-based streaming approaches
Comparing established and up-and-coming streaming approaches for an integrated real-time data model
Georg Heiler
Last updated on Apr 25, 2022
26 min read
Identifying the root cause of cable network problems with machine learning
Good quality network connectivity is ever more important. For hybrid fiber coaxial (HFC) networks, searching for upstream high noise in …
Georg Heiler
,
Thassilo Gadermaier
,
Thomas Haider
,
Allan Hanbury
,
Peter Filzmoser
Preprint
Cite
Scalable data pipelines from dagster with pyspark
Getting started with simple dagster pipelines.
Georg Heiler
,
Sandy Ryza
Last updated on Mar 17, 2022
5 min read
Scalable sparse matrix multiplication
Using Apache Spark for
sparse
matrix multiplication
Georg Heiler
Last updated on Aug 9, 2021
6 min read
Exact percentiles in Spark
Combining the power of Scala and Python to make the calculation of percentiles in Spark easy and fast
Georg Heiler
Last updated on Nov 22, 2020
7 min read
Arrow 2.0.0 - structs in pandas
Finally, nested types in Arrow.
Georg Heiler
Last updated on Nov 20, 2020
1 min read
Sparkling SCD2
Data preparation using spark without ACID tables
Georg Heiler
Last updated on Nov 19, 2020
4 min read
Run the latest version of spark
Execute the latest version of spark on HDP.
Georg Heiler
Last updated on Oct 29, 2020
4 min read
Production grade pyspark jobs
Use additional python packages with pyspark
Georg Heiler
Last updated on Nov 19, 2020
2 min read
»
Cite
×