Études fondées sur les communautés Reddit

Machine learning approaches to identifying neurodevelopmental disorders using social media data: a systematic review.

Int J Med Inform . 2026;213 :106378

Résumé

BACKGROUND: Neurodevelopmental disorders (NDDs), such as autism spectrum disorder (ASD) and attention-deficit/hyperactivity disorder (ADHD), often emerge in early childhood and can significantly impact social, academic, and emotional functioning. Early identification is critical to improving long-term outcomes, yet traditional diagnostic processes are time-consuming and resource-intensive. As social media becomes an integral part of daily life, user-generated content offers a novel source of behavioral and linguistic data that may support early detection. Advances in artificial intelligence (AI) and machine learning (ML) now make it possible to analyze these large-scale data streams for clinical insights.METHODS: A comprehensive search was performed across five databases-PubMed, Scopus, Web of Science, IEEE Xplore, and ACM Digital Library-from inception up to July 2025. Studies were included if they used AI or ML methods to analyze social media data for detecting NDDs. Data extraction focused on platform type, targeted disorder, dataset size, ML technique, and diagnostic performance.RESULTS: Nineteen studies met the inclusion criteria. Most focused on ASD and ADHD, using platforms such as Reddit, Twitter, YouTube, and Facebook. ML models achieved moderate to high classification performance, with F1-scores ranging from approximately 0.48 to 0.89 depending on the data type and disorder. Video-based models showed particular promise in identifying nonverbal behavioral markers. However, research on other NDDs remains limited, and methodological heterogeneity, small sample sizes, and ethical challenges persist.CONCLUSION: AI-driven analysis of social media data holds significant promise for scalable, non-invasive screening of neurodevelopmental disorders. While current work largely focuses on ASD and ADHD, future research should extend to underrepresented NDDs and address concerns related to data validity, bias, and privacy. With continued advancement, these tools may serve as valuable complements to traditional diagnostic methods.

Tous les articles