What Is Apache Spark? Key Features and Common Use Cases

apache-spark

We live in a data-driven world, where a large processing model is needed to handle massive amounts of data. Apache Spark is a helpful open-source platform and a big deal for processing massive volumes of data. However, you must first learn Apache Spark to work with large amounts of data. This blog will go over […]

A look into ETA Problem using Regression in Python – Machine Learning

ETA Problem using Regression in Python

The term “ETA” usually means “Estimated Time of Arrival” but in the technology realm it refers as “Estimated Completion Time” of a computational process in general. In particular, this problem is specific to estimating completion time of a batch of long scripts running parallel to each other. Problem A number of campaigns run together in […]