An Interface for User-Centred Process and Correlation Between Large Datasets

Standard database query systems are designed to process data on a single installation only, and do not provide optimal solutions for cases that data from multiple sources need to be queried. In these cases, the sources may have different data schemata, data representations etc., necessitating extensive coding and data transformations to retrieve partial results and combine them to reach the desired outcome. Differences in schemata and representations may be subtle and remain unnoticed, leading to the production of erroneous results. The goal of this paper is to present an easy-to-use solution for the end users, enabling them to query data from a given set of databases through a single user interface. This user interface allows users to visualize database contents and query results, while facilities for uploading and validating the data are also accommodated. To demonstrate the applicability of our approach, a use case is presented where data from two different sources are uploaded into the system and thereafter the data from the two databases can be utilized in tandem. The usability evaluation involved software developers in free evaluation scenarios.

[1]  Gusti Ayu Putri Saptawati,et al.  Database integration based on combination schema matching approach (case study: Multi-database of district health information system) , 2017, 2017 2nd International conferences on Information Technology, Information Systems and Electrical Engineering (ICITISEE).

[2]  Panagiotis Georgiadis,et al.  An integrated framework for adapting WS-BPEL scenario execution using QoS and collaborative filtering techniques , 2015, Sci. Comput. Program..

[3]  Yiyi Sun Third-Party Library Integration , 2019 .

[4]  Costas Vassilakis,et al.  An Adaptive Social Network-Aware Collaborative Filtering Algorithm for Improved Rating Prediction Accuracy , 2020, IEEE Access.

[5]  Neha Yadav,et al.  LARAVEL: A PHP Framework for E-Commerce Website , 2019, 2019 Fifth International Conference on Image Information Processing (ICIIP).

[6]  Ganesh S. Oak Information Visualization Introduction , 2022 .

[7]  Costas Vassilakis,et al.  "Talking" Triples to Museum Chatbots , 2020, HCI.

[8]  Ying Jiang,et al.  Multi-source heterogeneous Hakka culture heritage data management based on MongoDB , 2016, 2016 Fifth International Conference on Agro-Geoinformatics (Agro-Geoinformatics).

[9]  Venkat Dasari,et al.  Dynamic visualization of large scale tactical network simulations , 2019, 2019 IEEE International Conference on Big Data (Big Data).

[10]  Georgios Kouroupetroglou,et al.  Modeling Improved Prosody Generation from High-Level Linguistically Annotated Corpora , 2005, IEICE Trans. Inf. Syst..

[11]  Tatsuo Tsuji,et al.  Incremental Data Migration for Multi-database Systems Based on MySQL with Spider Storage Engine , 2014, 2014 IIAI 3rd International Conference on Advanced Applied Informatics.

[12]  Diana Maynard,et al.  Analysing and Enriching Focused Semantic Web Archives for Parliament Applications , 2014, Future Internet.

[13]  Dionisis Margaris,et al.  Improving collaborative filtering's rating prediction quality in dense datasets, by pruning old ratings , 2017, 2017 IEEE Symposium on Computers and Communications (ISCC).

[14]  C. Vassilakis,et al.  An integrated framework for QoS-based adaptation and exception resolution in WS-BPEL scenarios , 2013, SAC '13.

[15]  Xuyun Zhang,et al.  Diversified service recommendation with high accuracy and efficiency , 2020, Knowl. Based Syst..

[16]  Suphamit Chittayasothorn,et al.  A Multi-database Access System with Instance Matching , 2017, ACIIDS.

[17]  Yan Yan,et al.  Neural graph personalized ranking for Top-N Recommendation , 2020, Knowl. Based Syst..

[18]  Georgios Kouroupetroglou,et al.  Usability Methodologies for Real-Life Voice User Interfaces , 2009, Int. J. Inf. Technol. Web Eng..

[19]  Sotiris Ioannidis,et al.  A fine-grained social network recommender system , 2019, Social Network Analysis and Mining.

[20]  Bin Liu,et al.  Effective public service delivery supported by time-decayed Bayesian personalized ranking , 2020, Knowl. Based Syst..

[21]  Bin Wu,et al.  BSPR: Basket-sensitive personalized ranking for product recommendation , 2020, Inf. Sci..

[22]  Costas Vassilakis,et al.  A User Interface for Personalized Web Service Selection in Business Processes , 2020, HCI.

[23]  Dionisis Margaris,et al.  Handling uncertainty in social media textual information for improving venue recommendation formulation quality in social networks , 2019, Social Network Analysis and Mining.

[24]  Costas Vassilakis,et al.  Improving collaborative filtering’s rating prediction accuracy by introducing the experiencing period criterion , 2020, Neural Computing and Applications.

[25]  Mohammed Thakir Mahmood,et al.  Web Application Based on MVC Laravel Architecture for Online Shops , 2020 .

[26]  Raul M. S. Laureano,et al.  Business intelligence: Performance and sustainability measures in an ETL process , 2018, 2018 13th Iberian Conference on Information Systems and Technologies (CISTI).

[27]  Georgios Kouroupetroglou,et al.  Modeling Prosodic Structures in Linguistically Enriched Environments , 2004, TSD.

[28]  Xiaoying Wang,et al.  Research on Distributed Storage and Query Optimization of Multi-source Heterogeneous Meteorological Data , 2018, CCIOT 2018.

[29]  Costas Vassilakis,et al.  Semantics-Driven Conversational Interfaces for Museum Chatbots , 2020, HCI.

[30]  Bin Liu,et al.  A Versatile Event-Driven Data Model in HBase Database for Multi-source Data of Power Grid , 2016, 2016 IEEE International Conference on Smart Cloud (SmartCloud).

[31]  Vijay Rana,et al.  Web Search Personalization Using Semantic Similarity Measure , 2020 .

[32]  Sonali Vyas,et al.  A comparative study of various ETL process and their testing techniques in data warehouse , 2017 .

[33]  Fu Jia,et al.  Towards an integrated conceptual framework of supply chain finance: An information processing perspective , 2019 .

[34]  Jordi Cabot,et al.  NeoEMF: a Multi-database Model Persistence Framework for Very Large Models , 2016, D&P@MoDELS.

[35]  Yannis Stavrakas,et al.  The ARCOMEM Architecture for Social- and Semantic-Driven Web Archiving , 2014, Future Internet.

[36]  Alam Rahmatulloh,et al.  Optimizing Data Storage in Handling Dynamic Input Fields with JSON String Compression , 2020, 2020 8th International Conference on Information and Communication Technology (ICoICT).

[37]  Weidong Liu,et al.  Replica-Aware Partitioning Design in Parallel Database Systems , 2017, Euro-Par.

[38]  Costas Vassilakis,et al.  What makes a review a reliable rating in recommender systems? , 2020, Inf. Process. Manag..

[39]  Prasenjit Choudhury,et al.  Enhancing recommendation accuracy of item-based collaborative filtering using Bhattacharyya coefficient and most similar item , 2020, Applied Intelligence.

[40]  Bin Chen,et al.  Mean-Removed Product Quantization for Approximate Nearest Neighbor Search , 2019, 2019 International Conference on Data Mining Workshops (ICDMW).

[41]  Shefali Naik Accessing Data From Multiple Heterogeneous Distributed Database Systems , 2019 .

[42]  Gurvinder Singh,et al.  A review of different cost-based distributed query optimizers , 2018, Progress in Artificial Intelligence.

[43]  M. Nandhini,et al.  Collective dendrogram clustering with collaborative filtering for Distributed Data Mining on electronic health records , 2017, 2017 Second International Conference on Electrical, Computer and Communication Technologies (ICECCT).

[44]  Samiran Chattopadhyay,et al.  SysML Based Conceptual ETL Process Modeling , 2017 .

[45]  Costas Vassilakis,et al.  Social Relations versus Near Neighbours: Reliable Recommenders in Limited Information Social Network Collaborative Filtering for Online Advertising , 2019, 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[46]  K. Basford,et al.  Application of a dendrogram seriation algorithm to extract pattern from plant breeding data , 2017, Euphytica.

[47]  Ewa Maslowska,et al.  The effects of trait-based personalization in social media advertising , 2021, Comput. Hum. Behav..

[48]  Martín Ugarte,et al.  Foundations of JSON Schema , 2016, WWW.

[49]  Costas Vassilakis,et al.  Policy Making Analysis and Practitioner User Experience , 2020, HCI.

[50]  José Luis Ortega,et al.  Blogs and news sources coverage in altmetrics data providers: a comparative analysis by country, language, and subject , 2019, Scientometrics.

[51]  Michael Sirivianos,et al.  Hihooi: A Database Replication Middleware for Scaling Transactional Databases Consistently , 2020, ArXiv.

[52]  Peng Zhang,et al.  Practice Teaching Reform of Discrete Mathematics Model based on D3.js , 2019, 2019 14th International Conference on Computer Science & Education (ICCSE).

[53]  Costas Vassilakis,et al.  Data-Assisted Persona Construction Using Social Media Data , 2020, Big Data Cogn. Comput..

[54]  Chaomei Chen Information visualization: Information visualization , 2010 .

[55]  Nima Jafari Navimipour,et al.  Deterministic and non‐deterministic query optimization techniques in the cloud computing , 2019, Concurr. Comput. Pract. Exp..

[56]  Enhong Chen,et al.  Personalized Ranking with Importance Sampling , 2020, WWW.

[57]  Zhihong Shen,et al.  SimbaQL: A Query Language for Multi-source Heterogeneous Data , 2018, BigSDM.

[58]  Nezam Mahdavi-Amiri,et al.  Large-Neighborhood Infeasible Predictor–Corrector Algorithm for Horizontal Linear Complementarity Problems over Cartesian Product of Symmetric Cones , 2018, J. Optim. Theory Appl..

[59]  Panagiotis Georgiadis,et al.  Query personalization using social network information and collaborative filtering techniques , 2018, Future Gener. Comput. Syst..

[60]  Dionisis Margaris,et al.  Improving Collaborative Filtering’s Rating Prediction Quality by Exploiting the Item Adoption Eagerness Information , 2019, 2019 IEEE/WIC/ACM International Conference on Web Intelligence (WI).

[61]  Georgios Kouroupetroglou,et al.  Auditory universal accessibility of data tables using naturally derived prosody specification , 2010, Universal Access in the Information Society.

[62]  Lingxu Guo,et al.  A Multi-database Hybrid Storage Method for Big Data of Power Dispatching and Control , 2019, 2019 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computing, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI).

[63]  S. Kruikemeier,et al.  Personalization of politics on Facebook: examining the content and effects of professional, emotional and private self-personalization , 2020, Information, Communication & Society.

[64]  Costas Vassilakis,et al.  Improving Collaborative Filtering’s Rating Prediction Accuracy by Introducing the Common Item Rating Past Criterion , 2019, 2019 10th International Conference on Information, Intelligence, Systems and Applications (IISA).