A Cloud-based Exploration of Open Data: Promoting Transparency and Accountability of the Federal Government of Australia

The Open Data movement has become more popular since governments such as USA, UK, Australia and New Zealand decided to open up much of their public information. Data is open if anyone is free to use, reuse and redistribute it. The main benefits that a government can obtain from Open Data include transparency, participation and collaboration. The aim of this research is to promote transparency and accountability of the Federal Government of Australia by using Cloud-related technologies to transform a set of publicly available data into human-friendly visualizations in order to facilitate its analysis. The datasets include details of politicians, parties, political opinions and government contracts among others. This paper describes the stages involved in transforming an extensive and diverse collection of data to support effective visualization that helps to highlight patterns in the datasets that would otherwise be difficult or impossible to identify.

[1]  Raouf Boutaba,et al.  Cloud computing: state-of-the-art and research challenges , 2010, Journal of Internet Services and Applications.

[2]  Tom White,et al.  Hadoop: The Definitive Guide , 2009 .

[3]  Toby Velte,et al.  Cloud Computing, A Practical Approach , 2009 .

[4]  Jimmy J. Lin,et al.  Book Reviews: Data-Intensive Text Processing with MapReduce by Jimmy Lin and Chris Dyer , 2010, CL.

[5]  Jerry Brito,et al.  Hack, Mash & Peer: Crowdsourcing Government Transparency , 2007 .

[6]  Antony J. Williams,et al.  Beautiful Data: The Stories Behind Elegant Data Solutions , 2009 .

[7]  J. Chris Anderson,et al.  CouchDB: The Definitive Guide , 2010 .

[8]  Rajkumar Buyya,et al.  Article in Press Future Generation Computer Systems ( ) – Future Generation Computer Systems Cloud Computing and Emerging It Platforms: Vision, Hype, and Reality for Delivering Computing as the 5th Utility , 2022 .

[9]  Rudolf Bayer The Universal B-Tree for multidimensional Indexing , 1996 .

[10]  Daniel Lathrop,et al.  Open Government: Collaboration, Transparency, and Participation in Practice , 2010 .

[11]  Ali Khajeh-Hosseini,et al.  Research Agenda in Cloud Technologies , 2010, ArXiv.

[12]  Geoffrey C. Fox,et al.  MapReduce for Data Intensive Scientific Analyses , 2008, 2008 IEEE Fourth International Conference on eScience.

[13]  Clinton Gormley,et al.  Elasticsearch: The Definitive Guide , 2015 .

[14]  Tim Davies,et al.  Open data, democracy and public sector reform. A look at open government data use from data.gov.uk , 2010 .

[15]  P. Mell,et al.  The NIST Definition of Cloud Computing , 2011 .

[16]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[17]  Rudolf Bayer,et al.  The Universal B-Tree for Multidimensional Indexing: general Concepts , 1997, WWCA.

[18]  E. Felten,et al.  Government Data and the Invisible Hand , 2009 .