Hildebrandt, Kai, Panse, Fabian, Wilcke, Niklas, Ritter, Norbert: Large-Scale Data Pollution with Apache Spark. In: IEEE Transactions on Big Data (2020), http://ieeexplore.ieee.org/document/7809119/