skip to main content
10.1145/3329189.3329204acmotherconferencesArticle/Chapter ViewAbstractPublication PagespervasivehealthConference Proceedingsconference-collections
research-article

Towards Reliable Data Collection and Annotation to Extract Pulmonary Digital Biomarkers Using Mobile Sensors

Authors Info & Claims
Published:20 May 2019Publication History

ABSTRACT

Proliferation of sensors embedded in smartphones and smartwatches helps capture rich dataset for machine learning algorithms to extract meaningful digital bio-markers on consumer devices for monitoring disease progression and treatment response. However, development and validation of machine learning algorithms depend on gathering high fidelity sensor data and reliable ground-truth. We conduct a study, called mLungStudy, with 131 subjects with varying pulmonary conditions to collect mobile sensor data including audio, accelerometer, gyroscope using a smartphone and a smartwatch, in order to extract pulmonary biomarkers such as breathing, coughs, spirometry, and breathlessness. Our study shows that commonly used breathing ground-truth data from chestband may not always be reliable as a gold-standard. Our analysis shows that breathlessness biomarkers such as pause time and pause frequency from 2.15 minutes of audio can be as reliable as those extracted from 5 minutes' worth of speech data. This finding can be useful for future studies to trade-off between the reliability of breathlessness data and patient comfort in generating continuous speech data. Furthermore, we use crowdsourcing techniques to annotate pulmonary sound events for developing signal processing and machine learning algorithms. In this paper, we highlight several practical challenges to collect and annotate physiological data and acoustic symptoms from chronic pulmonary patients and ways to improve data quality. We show that the waveform visualization of the audio signal improves annotation quality which leads to a 6.59% increase in cough classification accuracy and a 6% increase in spirometry event classification accuracy. Findings from this study inform future studies focusing on developing explainable machine learning models to extract pulmonary digital bio-markers using mobile sensors.

References

  1. Mohsin Ahmed, Md Mahbubur Rahman, Viswam Nathan, Ebrahim Nemati, Korosh Vatanparvar, and Jilong Kuang. 2019. mLung: Privacy-Preserving Naturally Windowed Lung Activity Detection for Pulmonary Patients. In IEEE BSN.Google ScholarGoogle Scholar
  2. ALA 2018. American Lung Association. Retrieved December 12, 2018 from http://www.lung.org/lung-health-and-diseases/lung-disease-lookup/copd/learn-about-copd/how-serious-is-copd.htmlGoogle ScholarGoogle Scholar
  3. Rummana Bari, Roy J Adams, Md Mahbubur Rahman, Megan Battles Parsons, Eugene H Buder, and Santosh Kumar. 2018. rConverse: Moment by Moment Conversation Detection Using a Mobile Respiration Sensor. ACM IMWUT (2018). Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Andrew Bates, Martin J Ling, Janek Mann, and DK Arvind. 2010. Respiratory Rate and Flow Waveform Estimation from Tri-axial Accelerometer Data. In IEEE BSN. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Matthias Budde, Andrea Schankin, Julien Hoffmann, Marcel Danz, Till Riedel, and Michael Beigl. 2017. Participatory Sensing or Participatory Nonsense?: Mitigating the Effect of Human Error on Data Quality in Citizen Science. ACM IMWUT (2017). Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. S Burge and JA Wedzicha. 2003. COPD Exacerbations: Definitions and Classifications. European Respiratory Journal (2003).Google ScholarGoogle Scholar
  7. Pierre-Régis Burgel, Pascale Nesme-Meyer, Pascal Chanez, Denis Caillaud, Philippe Carré, Thierry Perez, Nicolas Roche, et al. 2009. Cough and Sputum Production are Associated with Frequent Exacerbations and Hospitalizations in COPD Subjects. Elsevier Chest (2009).Google ScholarGoogle Scholar
  8. Soujanya Chatterjee, Md Mahbubur Rahman, Ebrahim Nemati, and Jilong Kunag. 2019. WheezeD: Respiration Phase Based Wheeze Detection Using Acoustic Data From Pulmonary Patients Under Attack. In ACM Pervasive Computing Technologies for Healthcare (PervasiveHealth).Google ScholarGoogle Scholar
  9. Emil Chiauzzi, Carlos Rodarte, and Pronabesh DasMahapatra. 2015. Patient-centered activity monitoring in the self-management of chronic health conditions. BMC medicine (2015).Google ScholarGoogle Scholar
  10. Alvaro A Cruz. 2007. Global surveillance, prevention and control of chronic respiratory diseases: a comprehensive approach. World Health Organization.Google ScholarGoogle Scholar
  11. Emre Ertin, Nathan Stohs, Santosh Kumar, Andrew Raij, Mustafa al'Absi, and Siddharth Shah. 2011. AutoSense: Unobtrusively Wearable Sensor Suite for Inferring the Onset, Causality, and Consequences of Stress in the Field. In ACM SenSys. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Florian Eyben, Martin Wöllmer, and Björn Schuller. 2010. Opensmile: the Munich Versatile and Fast Open-source Audio Feature Extractor. In ACM conference on Multimedia. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Grant Fairbanks. 1960. Voice and Articulation Drillbook. Harper New York.Google ScholarGoogle Scholar
  14. FigureEight 2019. Figure Eight. Retrieved April 15, 2019 from https://www.figure-eight.com/Google ScholarGoogle Scholar
  15. Kevin E Forkheim, David Scuse, and Hans Pasterkamp. 1995. A comparison of neural network models for wheeze detection. In IEEE WESCANEX Communications, Power, and Computing.Google ScholarGoogle Scholar
  16. Javier Hernandez, Daniel McDuff, and Rosalind W Picard. 2015. Biowatch: estimation of heart and breathing rates from wrist motions. In IEEE Pervasive Computing Technologies for Healthcare (PervasiveHealth). Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Javier Hernandez, Daniel J McDuff, and Rosalind W Picard. 2015. Biophone: Physiology monitoring from peripheral smartphone motions. (2015).Google ScholarGoogle Scholar
  18. Eric C Larson, Mayank Goel, Gaetano Boriello, Sonya Heltshe, Margaret Rosenfeld, and Shwetak N Patel. 2012. SpiroSmart: using a microphone to measure lung function on a mobile phone. In ACM UbiComp. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Daniyal Liaqat, Robert Wu, Andrea Gershon, Hisham Alshaer, Frank Rudzicz, and Eyal de Lara. 2018. Challenges with Real-world Smartwatch based Audio Monitoring. In ACM Workshop on Wearable Systems and Applications. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. LungInstitute 2018. Lung Institute. Retrieved December 12, 2018 from https://lunginstitute.com/blog/the-cost-of-lung-disease/Google ScholarGoogle Scholar
  21. Marc Miravitlles. 2011. Cough and Sputum Production as Risk Factors for Poor Outcomes in Patients with COPD. Elsevier Respiratory Medicine (2011).Google ScholarGoogle Scholar
  22. Jimmy Moore, Pascal Goffin, Miriah Meyer, Philip Lundrigan, Neal Patwari, Katherine Sward, and Jason Wiese. 2018. Managing In-home Environments through Sensing, Annotating, and Visualizing Air Quality Data. ACM IMWUT (2018). Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Bernard Munos, Pamela C Baker, Brian M Bot, Michelle Crouthamel, Glen Vries, Ian Ferguson, John D Hixson, Linda A Malek, John J Mastrototaro, Veena Misra, Aydogan Ozcan, Leonard Sacks, and Pei Wang. 2016. Mobile Health: the Power of Wearables, Sensors, and Apps to Transform Clinical Trials. Wiley Online Library Annals of the New York Academy of Sciences (2016).Google ScholarGoogle Scholar
  24. Viswam Nathan, Korosh Vatanparvar, Md Mahbubur Rahman, Ebrahim Nemati, and Jilong Kuang. 2019. Assessment of Chronic Pulmonary Disease Patients Using Biomarkers from Natural Speech Recorded by Mobile Devices. In IEEE BSN.Google ScholarGoogle Scholar
  25. Ebrahim Nemati, Christina Batteate, and Michael Jerrett. 2017. Opportunistic Environmental Sensing with Smartphones: a Critical Review of Current Literature and Applications. Springer Current environmental health reports (2017).Google ScholarGoogle Scholar
  26. Ebrahim Nemati, Md Mahbubur Rahman, Viswam Nathan, and Jilong Kuang. 2018. Private Audio-Based Cough Sensing for In-Home Pulmonary Assessment using Mobile Devices. In EAI International Conference on Body Area Networks.Google ScholarGoogle Scholar
  27. Ebrahim Nemati, Young Soo Suh, Babak Motamed, and Majid Sarrafzadeh. 2016. Gait Velocity Estimation for a Smartwatch Platform using Kalman Filter Peak Recovery. In IEEE BSN.Google ScholarGoogle Scholar
  28. Nuance 2019. Nuance PowerMic III. Retrieved January 15, 2019 from https://www.nuance.com/dragon/dragon-accessories.htmlGoogle ScholarGoogle Scholar
  29. M Pourhomayoun, E Nemati, B Mortazavi, and M Sarrafzadeh. 2015. Context-aware data analytics for activity recognition. In International Conference on Data Analytics.Google ScholarGoogle Scholar
  30. Public Good 2018. Private Data for Public Good. Retrieved December 18, 2018 from http://hdexplore.calit2.net/wp-content/uploads/2015/08/hdx_final_report_small.pdfGoogle ScholarGoogle Scholar
  31. Mahbubur Rahman, Nasir Ali, Rummana Bari, Nazir Saleheen, Mustafa al'Absi, Emre Ertin, Ashley Kennedy, Kenzie L Preston, and Santosh Kumar. 2017. mDebugger: Assessing and Diagnosing the Fidelity and Yield of Mobile Sensor Data. In Springer Mobile Health.Google ScholarGoogle Scholar
  32. Md Mahbubur Rahman, Rummana Bari, Amin Ahsan Ali, Moushumi Sharmin, Andrew Raij, Karen Hovsepian, Syed Monowar Hossain, Emre Ertin, Ashley Kennedy, David H Epstein, Kenzie L Preston, Michelle Jobes, Kenneth D Ward, Mustafa al'Absi, and Santosh Kumar. 2014. Are We There Yet? Feasibility of Continuous Stress Assessment via Wireless Physiological Sensors. In ACM Conference on Bioinformatics, Computational Biology, and Health Informatics. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Md Mahbubur Rahman, Ebrahim Nemati, Viswam Nathan, and Jilong Kuang. 2018. InstantRR: Instantaneous Respiratory Rate Estimation on Context-aware Mobile Devices. In EAI International Conference on Body Area Networks.Google ScholarGoogle Scholar
  34. Javier Ramırez, José C Segura, Carmen Benıtez, Angel De La Torre, and Antonio Rubio. 2004. Efficient Voice Activity Detection Algorithms Using Long-term Speech Information. Elsevier Speech communication (2004).Google ScholarGoogle Scholar
  35. Ronald W Schafer. 2011. What is a Savitzky-Golay Filter?{lecture notes}. IEEE Signal processing magazine (2011).Google ScholarGoogle Scholar
  36. sHealth 2018. Samsung Health. Retrieved December 18, 2018 from https://www.samsung.com/us/support/answer/ANS00062448/Google ScholarGoogle Scholar
  37. Germán D Sosa, Angel Cruz-Roa, and Fabio A González. 2015. Automatic Detection of Wheezes by Evaluation of Multiple Acoustic Feature Extraction Methods and C-weighted SVM. In Society for Optics and Photonics Symposium on Medical Information Processing and Analysis.Google ScholarGoogle Scholar
  38. Xiao Sun, Li Qiu, Yibo Wu, Yeming Tang, and Guohong Cao. 2017. Sleepmonitor: Monitoring respiratory rate and body position during sleep using smartwatch. ACM IMWUT (2017). Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. R Tehrany. 2015. Speech Breathing Patterns in Health and Chronic Respiratory Disease. Ph.D. Dissertation. University of Southampton.Google ScholarGoogle Scholar
  40. Korosh Vatanparvar, Viswam Nathan, Ebrahim Nemati, Md Mahbubur Rahman, and Jilong Kuang. 2019. A Generative Model for Speech Segmentation and Obfuscation for Remote Health Monitoring. In IEEE BSN.Google ScholarGoogle Scholar
  41. Varun Viswanath, Jake Garrison, and Shwetak Patel. 2018. SpiroConfidence: Determining the Validity of Smartphone Based Spirometry Using Machine Learning. In IEEE EMBC.Google ScholarGoogle Scholar
  42. Rui Wang, Fanglin Chen, Zhenyu Chen, Tianxing Li, Gabriella Harari, Stefanie Tignor, Xia Zhou, Dror Ben-Zeev, and Andrew T Campbell. 2014. StudentLife: Assessing Mental Health, Academic Performance and Behavioral Trends of College Students using Smartphones. In ACMUbiComp. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Shichao Yue, Hao He, Hao Wang, Hariharan Rahul, and Dina Katabi. 2018. Extracting Multi-Person Respiration from Entangled RF Signals. ACM IMWUT (2018). Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Youwei Zeng, Dan Wu, Ruiyang Gao, Tao Gu, and Daqing Zhang. 2018. Full-Breathe: Full Human Respiration Detection Exploiting Complementarity of CSI Phase and Amplitude of WiFi Signals. ACM IMWUT (2018). Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. zephyr 2019. Zephyr Bioharness. Retrieved January 15, 2019 from https://www.zephyranywhere.com/Google ScholarGoogle Scholar

Index Terms

  1. Towards Reliable Data Collection and Annotation to Extract Pulmonary Digital Biomarkers Using Mobile Sensors

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        PervasiveHealth'19: Proceedings of the 13th EAI International Conference on Pervasive Computing Technologies for Healthcare
        May 2019
        475 pages
        ISBN:9781450361262
        DOI:10.1145/3329189

        Copyright © 2019 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 20 May 2019

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed limited

        Acceptance Rates

        Overall Acceptance Rate55of116submissions,47%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader