A scalable platform for big data analysis in public transport


Uçak E., Karagümüş E., ŞENER C.

Concurrency and Computation: Practice and Experience, 2021 (Journal Indexed in SCI) identifier identifier

  • Publication Type: Article / Abstract
  • Volume:
  • Publication Date: 2021
  • Doi Number: 10.1002/cpe.6534
  • Title of Journal : Concurrency and Computation: Practice and Experience
  • Keywords: Apache Beam, big data analysis, Google Cloud Dataflow, public transport, scalability, DATA ANALYTICS

Abstract

© 2021 John Wiley & Sons Ltd.Any life event or action can be seen as a potential source of data to analyze. By analyzing such data, we can gain insights into the facts. The situation is no different in public transport. Researchers working in the fields of transport and traffic have stated that such an analysis would be invaluable in designing urban transport and particularly in adapting to current changes. In this study, a scalable public transport analysis platform named Cermoni is developed using the Apache Beam programming model. It can analyze in near-real-time smart card and vehicle location data collected, classified as big data with its high production speed. The performance of the platform was tested on Google Cloud Dataflow service using real-world data gathered from Konya, one of the largest metropolitan cities in Turkey, and the results are discussed in detail.