Language Sample Analysis in Bilingual Children: A Meta-Analysis of Diagnostic Accuracy.

TL;DR

Language sample analysis (LSA) shows moderate diagnostic accuracy for identifying language disorders in bilingual children (70% sensitivity, 83% specificity), but performs best when combined with other assessment methods rather than used in isolation.

What the Study Found

Identifying developmental language disorder (DLD) in bilingual children presents a genuine clinical challenge. Standard language tests designed for monolingual speakers often misclassify bilingual children as language-disordered when they are simply navigating two language systems, while other children with genuine disorders can slip through assessment gaps. Language sample analysis, which involves analyzing spontaneous speech or elicited language samples to measure metrics like utterance length, grammatical complexity, and vocabulary diversity, is often promoted as a more culturally fair alternative to standardized testing.

This meta-analysis examined nine studies evaluating LSA's diagnostic accuracy in bilingual children aged 2-7 years. The findings reveal a middle ground: LSA is useful, but inconsistently so. The pooled sensitivity (ability to correctly identify children who have DLD) was 70%, meaning it would miss about 3 in 10 children with genuine language disorders. Specificity (ability to correctly identify typically developing children) was more robust at 83%, meaning it would incorrectly flag roughly 1 in 5 typically developing children as disordered. This sensitivity-specificity gap matters clinically: false negatives (missed cases) can delay intervention for children who need it, while false positives can lead to unnecessary services.

The variability across studies was substantial. Some individual LSA metrics showed poor diagnostic accuracy, while others performed in the fair to good range. Critically, the language of elicitation made no significant difference in accuracy, meaning whether assessments were conducted in the child's first or second language didn't systematically improve or worsen diagnostic identification. However, integrated measures that combined multiple LSA metrics with other assessment approaches (such as parent report, additional language samples, or standardized tests adapted for bilinguals) showed the highest diagnostic accuracy overall. These combinations approached both higher sensitivity and specificity than LSA alone.

The meta-analysis underscores an important distinction: which specific LSA metrics were used mattered substantially. Not all measures of language production carry equal diagnostic weight. Combinations of multiple indices, such as pairing mean length of utterance with vocabulary measures or adding morphosyntactic analysis, performed better than any single metric in isolation. This suggests that language production is multidimensional, and a one-metric approach to assessment captures an incomplete picture.

What This Means for You

If you are a clinician or educator working with bilingual children and language development concerns exist:

LSA is a legitimate tool, but design your assessment accordingly. The 70% sensitivity means LSA should not be your sole decision point. Use it as one component within a battery that includes other information: parent observations across both languages, structured tasks beyond spontaneous speech, and ideally, input from someone with bilingual language expertise.

Choose multiple metrics over single measures. If you are conducting language sample analysis, don't rely on mean length of utterance alone or vocabulary count alone. The evidence supports using combinations of metrics (grammatical complexity plus vocabulary diversity plus utterance structure, for example) for more reliable identification.

Language of assessment didn't predict accuracy. You don't need to conduct the assessment exclusively in the dominant language or the home language to improve identification. However, this doesn't mean language choice is irrelevant to the child's experience; rather, the choice of language for the sample itself doesn't mechanically improve or worsen diagnostic accuracy. Quality of the sample and skill in analysis appear to matter more.

Recognize the miss rate. A 70% sensitivity leaves meaningful room for false negatives. If a child has risk factors for language disorder or if clinical judgment raises concerns despite LSA results, pursue additional assessment. Sensitivity of this level should not produce definitive conclusions on its own.

Study Details

Factor	Details
Study Type	Systematic review and meta-analysis
Number of Studies Included	9
Participant Age Range	2-7 years
Condition	Developmental language disorder (DLD) in bilingual children
Primary Outcome	Diagnostic accuracy (sensitivity and specificity) of LSA
Pooled Sensitivity	70%
Pooled Specificity	83%
Key Finding	Integrated measures (LSA plus other methods) showed highest accuracy
Journal	American Journal of Speech-Language Pathology

Sources

American Journal of Speech-Language Pathology. Language Sample Analysis in Bilingual Children: A Diagnostic Accuracy Meta-Analysis. https://pubmed.ncbi.nlm.nih.gov/41842702/

ProtocolEngine provides general health information based on published research. This is not medical advice. Consult a healthcare professional before starting any supplement or health protocol.