Detecting Parkinson's Disease from Interactions with a Search Engine: Is Expert Knowledge Sufficient?

Parkinson's disease (PD) is a slowly progressing neurodegenerative disease with early manifestation of motor signs. Recently, there has been a growing interest in developing automatic tools that can assess motor function in PD patients. Here we show that mouse tracking data collected during people's interaction with a search engine can be used to distinguish PD patients from similar, non-diseased users and present a methodology developed for the diagnosis of PD from these data. The main challenge we address is the extraction of informative features from raw mouse tracking data. We do so in two complementary ways: First, we manually construct expert-recommended features, aiming to identify abnormalities in motor behaviors. Second, we use an unsupervised representation learning technique to map these raw data to high-level features. Using all the extracted features, a Random Forest classifier is then used to distinguish PD patients from controls, achieving an AUC of 0.92, while results using only expert-generated or auto-generated features are 0.87 and 0.83, respectively. Our results indicate that mouse tracking data can help in detecting users at early stages of the disease and that both expert-generated features and unsupervised techniques for feature generation are required to achieve the best possible performance.

[1]  Ian Butterworth,et al.  Computer keyboard interaction as an indicator of early Parkinson’s disease , 2016, Scientific Reports.

[2]  S. Gilman,et al.  Diagnostic criteria for Parkinson disease. , 1999, Archives of neurology.

[3]  Ilya Sutskever,et al.  Learning to Generate Reviews and Discovering Sentiment , 2017, ArXiv.

[4]  M. Breteler,et al.  Epidemiology of Parkinson's disease , 2006, The Lancet Neurology.

[5]  Daqing He,et al.  Searching, browsing, and clicking in a search session: changes in user behavior by task and over time , 2014, SIGIR.

[6]  Fernando Diaz,et al.  Search Result Prefetching Using Cursor Movement , 2016, SIGIR.

[7]  Masoud Nikravesh,et al.  Feature Extraction: Foundations and Applications (Studies in Fuzziness and Soft Computing) , 2006 .

[8]  Ryen W. White,et al.  User see, user point: gaze and cursor alignment in web search , 2012, CHI.

[9]  Elad Yom-Tov Clinically verified pre-screening for cancer using web search queries: Initial results , 2018, ArXiv.

[10]  Mounia Lalmas,et al.  On saliency, affect and focused attention , 2012, CHI.

[11]  Masoud Nikravesh,et al.  Feature Extraction - Foundations and Applications , 2006, Feature Extraction.

[12]  Ryen W. White,et al.  Evaluation of the Feasibility of Screening Patients for Early Signs of Lung Carcinoma in Web Search Logs , 2017, JAMA oncology.

[13]  Eugene Agichtein,et al.  Inferring Searcher Attention by Jointly Modeling User Interactions and Content Salience , 2015, SIGIR.

[14]  Diana Borsa,et al.  Automatic Identification of Web-Based Risk Markers for Health Events , 2015, Journal of medical Internet research.

[15]  Joemon M. Jose,et al.  User engagement in online News: Under the scope of sentiment, interest, affect, and gaze , 2014, J. Assoc. Inf. Sci. Technol..

[16]  Ryen W. White,et al.  Detecting neurodegenerative disorders from web search signals , 2018, npj Digital Medicine.

[17]  Ryen W. White,et al.  No clicks, no problem: using cursor movements to understand and improve search , 2011, CHI.

[18]  Luis A. Leiva,et al.  Predicting User Engagement with Direct Displays Using Mouse Cursor Information , 2016, SIGIR.