20% off all books with the code: BOOKS
  • check 10+ million books
  • check New arrivals every day
  • check Trusted by 1M+ customers
  • check Great prices & discounts
  • check Shipping across Europe

Streaming Predictive Analytics on Apache Flink - Foteini Beligianni

English
2016-10-30
€41.34 €51.68

-20% with code BOOKS

In stock at our supplier

Shipping in 12-18 days

30-day return policy

Data analysis and predictive analytics today are driven by large scale distributed deployments of complex pipelines, guiding data cleaning, model training and evaluation. In this work, we focus on the problem of modelling such a pipeline framework and providing algorithms that build on top of basic abstractions, fundamental to stream processing. We design a streaming machine learning pipeline as a series of ... Full description

You May Also Like

Description

Data analysis and predictive analytics today are driven by large scale distributed deployments of complex pipelines, guiding data cleaning, model training and evaluation. In this work, we focus on the problem of modelling such a pipeline framework and providing algorithms that build on top of basic abstractions, fundamental to stream processing. We design a streaming machine learning pipeline as a series of stages such as model building, concept drift detection and continuous evaluation. We build our prototype on Apache Flink, a distributed data processing system with streaming capabilities along with a state-of-the-art implementation of a variation of Vertical Hoeffding Tree (VHT), a distributed decision tree classification algorithm as a proof of concept. Furthermore, we compare our version of VHT with the current state-of-the-art implementations on distributed data processing systems. Our experimental results on real-world data sets show significant performance benefits of our pipeline while maintaining low classification error. We believe, that this pipeline framework can offer a good baseline for a full-fledged implementation of streaming algorithms which can work in parallel.

More Information

Author Foteini Beligianni
Publisher LAP Lambert Academic Publishing
Release year 2016
Cover type Softcover
EAN 9783659960833
Write Your Own Review
You're reviewing: Streaming Predictive Analytics on Apache Flink
Your Rating:

Goodreads Reviews

€41.34 €51.68