Confidence Model-Based Data Repair for Telco Localization

Telecommunication (Telco) localization is a technique to localize mobile devices (MDs) outdoor by using measurement report (MR) data. Unfortunately, existing Telco localization approaches (with localization error > 50 meters) cannot achieve comparable localization accuracy as GPS (with localization error ±10 meters). In particular, due to signal interference and attenuation caused by high buildings in urban cities, it is hard to achieve high localization accuracy if MR records contain unstable signal data. To this end, we propose to first detect those MR records incurring high localization errors and next repair the predicted location (with high errors). Our experiments on two real MR data sets from 2G GSM and 4G LTE Telco networks verify that a Telco localization approach, enhanced by the proposed detection and repair algorithms, can greatly improve localization accuracy. For example, the enhancement algorithm on 2G and 4G data sets can achieve 29.5 and 13.2 meters of median errors, around 160.68% and 201.51% better than previous results. Such result indicates that our work can achieve nearly comparable localization accuracy as GPS.

[1]  R.L. Moses,et al.  Locating the nodes: cooperative localization in wireless sensor networks , 2005, IEEE Signal Processing Magazine.

[2]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[3]  Paul Horton,et al.  Network-based de-noising improves prediction from microarray data , 2006, BMC Bioinformatics.

[4]  Bjoern H. Menze,et al.  A comparison of random forest and its Gini importance with standard chemometric methods for the feature selection and classification of spectral data , 2009, BMC Bioinformatics.

[5]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[6]  Fei Wang,et al.  OceanST: A Distributed Analytic System for Large-Scale Spatiotemporal Mobile Broadband Data , 2014, Proc. VLDB Endow..

[7]  Mohamed Ibrahim,et al.  CellSense: An Accurate Energy-Efficient GSM Positioning System , 2011, IEEE Transactions on Vehicular Technology.

[8]  Chunping Li,et al.  Turn Waste into Wealth: On Simultaneous Clustering and Cleaning over Dirty Data , 2015, KDD.

[9]  Bernhard Schölkopf,et al.  Domain Adaptation with Conditional Transferable Components , 2016, ICML.

[10]  Ting Wang,et al.  Application of Breiman's Random Forest to Modeling Structure-Activity Relationships of Pharmaceutical Molecules , 2004, Multiple Classifier Systems.

[11]  H. Koshima,et al.  Personal locator services emerge , 2000 .

[12]  Luca Calderoni,et al.  Indoor localization in a hospital environment using Random Forest classifiers , 2015, Expert Syst. Appl..

[13]  Rajeev Rastogi,et al.  A cost-based model and effective heuristic for repairing constraints by value modification , 2005, SIGMOD '05.

[14]  Jianmin Wang,et al.  Constraint-Variance Tolerant Data Repairing , 2016, SIGMOD Conference.

[15]  Wenfei Fan,et al.  Dependencies revisited for improving data quality , 2008, PODS.

[16]  Hans-Peter Kriegel,et al.  Minimal spatio-temporal database repairs , 2013, SIGSPATIAL/GIS.

[17]  Roberto Tamassia,et al.  Minimal Spatio-Temporal Database Repairs , 2015, SSTD.

[18]  Ramón Díaz-Uriarte,et al.  Gene selection and classification of microarray data using random forest , 2006, BMC Bioinformatics.

[19]  Victoria J. Hodge,et al.  A Survey of Outlier Detection Methodologies , 2004, Artificial Intelligence Review.

[20]  Tao Gu,et al.  City-Scale Localization with Telco Big Data , 2016, CIKM.

[21]  B. Ludden,et al.  GSM standards activity on location , 1999 .

[22]  Moustafa Youssef,et al.  Dejavu: an accurate energy-efficient outdoor localization system , 2013, SIGSPATIAL/GIS.

[23]  Niki Pissinou,et al.  Belief based data cleaning for wireless sensor networks , 2012, Wirel. Commun. Mob. Comput..

[24]  Erik G. Ström,et al.  RSS-based sensor localization with unknown transmit power , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[25]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[26]  Michael Gertz,et al.  Advances in Spatial and Temporal Databases , 2017, Lecture Notes in Computer Science.