In June 2008, Prof. Volker Mark laid

In June 2008, Prof. Volker Mark laid the foundations of a new group at Technical university Berlin. The name of group was made as Database systems and Information Management abbreviated as DIMA. This is the same time, when the idea about apache flink was initiated. Professor Volker had a vision to develop a system which can process massive amount of data parallel and in distributed fashion. It was the same time when a similar rival of apache Flink, Apache spark was released in 2009 in USA. The idea of apache flink was to have similar solution to apache spark but to have distinct features which can outperform apache spark. The idea was to provide a system with unified API’s to process large amount of data using relation query language with user defined functions and provide the much better stream processing framework. Prof. Markl’s PhD students Stephan Ewen and Fabian Hüske built the very first prototype and shortly thereafter teamed up with Daniel Warneke, a PhD student in Prof. Odej Kao’s Complex and Distributed IT Systems (CIT) Group at TU Berlin. Soon after, Prof. Markl and Prof. Kao sought to collaborate with additional systems researchers in the greater Berlin area, in order to extend, harden, and validate their initial prototype. In 2009, Prof. Markl and Prof. Kao, jointly with researchers from Humboldt University (HU) of Berlin & the Hasso Plattner Institute (HPI) in Potsdam, co-wrote a DFG (German Research Foundation) research unit proposal entitled “Stratosphere – Information Management on the Cloud R4,” which was funded in 2010. This initial DFG grant (spanning 2010-2012) extended the original vision to develop a novel, database-inspired approach to analyze, aggregate, and query very large collections of either textual or (semi-)structured data on a virtualized, massively parallel cluster architecture. The follow-on DFG proposal entitled, “Stratosphere II: Advanced Analytics for Big Data” was also jointly co-written by researchers at TU Berlin, HU Berlin, and HPI and was funded in 2012. This second DFG grant (spanning 2012-2015) shifted the focus towards the processing of complex data analysis programs with low-latency. These early initiatives coupled with grants from the EU FP7 and Horizon 2020 Programmes, EIT Digital, German Federal Ministries (BMBF and BMWi), and industrial grants from IBM, HP, and Deutsche Telekom, among others provided the financial resources necessary to lay the initial foundation. Certainly, funding plays a critical role, however, success could only be achieved with the support of numerous collaborators, including members at DFKI (The German Research Centre for Artificial Intelligence), SICS (The Swedish Institute of Computer Science), and SZTAKI (The Hungarian Academy of Sciences), among many others who believed in our vision, contributed, and provided support over the years. In addition, the contributions from numerous PhD and Master’s students, and Postdoctoral Researcher Dr. Kostas Tzoumas paved the way for what is today Apache Flink. The project was accepted as an Apache Incubator project on April 16, 2014 and then went on to become an Apache Top-Level Project in December 2014. Kostas Tzoumas and Stephan Ewen established a startup company and gathered many creators of apache Flink. They named the company as Data Artisans. The idea was to make  a company which was devoted to make apache flink as data driven next generation open source project for programming data-intensive applications.The core feature of Apache Flink was to provide true streaming that is a streaming which is not based on mini batches to simulate streaming only. Apache Flink is based on streams and operators. It has attractive features like abstraction, high level set of APIS, feature to combine static and streaming data, support to run SQL queries on the data, Graph processing, machine learning and real time stream processing. It came later than Apache Spark and outperformed many systems including Apache Spark in stream processing.Apache Flink got popularity due to its continuous checkpointing and better fault tolerance in streaming applications. We measure the popularity of apache Flink by stackoverflow tags, flink mailing list, contributors, fundings and customers. data Artisans started with a seed financing round of 1 million euros from b-to-v Partners in summer 2014, and raised a Series A round of 5.5 million euros led by Intel Capital with participation from b-to-v Partners and Tengelmann Ventures in April 2016. Data Artisans got the funding from different investors and its total funding is now about $6 million. Since the company was founded, many team members (flink.apache.org/community.html#people) from data Artisans are active contributors to Apache Flink, So, the people are successful in their efforts to make apache flink popular and now a days, apache flink is a standard in stream processing and leading the industry in its domain. Universities and companies are using apache flink for data processing.  Apache Flink is today one of the most active open source projects in the Apache Software Foundation with users in academia and industry, as well as contributors and communities all around the world.