Fact-checked by Grok 2 weeks ago

Bayley Scales of Infant Development

The Bayley Scales of Infant and Toddler Development (BSID) is a standardized, norm-referenced assessment tool designed to evaluate developmental functioning in infants and toddlers aged 16 days to 42 months, identifying potential delays across key domains such as , , motor skills, social-emotional development, and to facilitate early and . Developed by Nancy Bayley, the BSID was first published in 1969 as a comprehensive instrument for assessing , with subsequent revisions expanding its scope and refining its methodology to reflect advances in . The second edition (BSID-II) appeared in 1993, which expanded the age range to 1 to 42 months, followed by the third edition (BSID-III) in 2006, which introduced five core domains using dichotomous scoring (pass/fail). The most recent iteration, the BSID-4 released in 2019 by Pearson Assessments, streamlines administration with polytomous scoring (0-2 points per item), fewer test items for efficiency, and options for caregiver questionnaires and web-based scoring, while maintaining the 30- to 70-minute administration time depending on the child's age. Clinically, the BSID is widely used by professionals such as pediatricians, psychologists, and early intervention specialists to detect developmental disabilities, which affect approximately one in six children in the United States, enabling timely referrals for therapies or support services. It provides scores including scaled subtest scores, composite domain scores, ranks, intervals, equivalents, and values, all derived from a normative sample to ensure reliability and validity in diverse populations. The tool requires trained administrators and includes materials like manipulatives, observation checklists, and parent forms, with adaptations for telepractice to broaden . Overall, the BSID remains a cornerstone in , supporting evidence-based practices to promote optimal developmental outcomes.

Overview

Purpose and Scope

The Bayley Scales of Infant and Toddler Development (Bayley Scales) is a standardized, play-based tool designed to measure key developmental milestones in and by observing their interactions and responses during structured tasks. Its core purpose is to identify developmental delays or strengths early in life, providing standardized scores—such as developmental quotients () in earlier editions—that emphasize age-relative functioning over traditional IQ measures, thereby supporting targeted interventions for at-risk children. This approach prioritizes holistic evaluation to detect subtle variations in growth, enabling clinicians and educators to address potential issues before they widen. The scope of the Bayley Scales extends across multiple developmental domains, including cognitive abilities, language and communication skills, motor development (fine and gross), social-emotional functioning, and , offering a multifaceted profile of a child's progress. Applicable from 16 days to 42 months of age, it is particularly valuable for screening and in populations prone to , such as preterm infants or children with disabilities, where timely can inform personalized support strategies. Since its introduction in 1969 as a foundational tool for general developmental , the scales have evolved into a comprehensive, multi-domain instrument, refined through successive editions to enhance reliability and clinical utility. In clinical practice, the Bayley Scales facilitate , intervention planning, and progress tracking for young children exhibiting developmental concerns. They play a critical role in determining eligibility for early intervention programs, such as those mandated under Part C of the (IDEA) in the United States, which requires evidence of significant delays in specified domains for service access. In research contexts, the scales are employed to investigate developmental trajectories, evaluate the effects of medical conditions like prematurity, and assess intervention outcomes, contributing to broader understandings of early human growth.

Administration and Scoring

The Bayley Scales of Infant and Toddler Development are administered individually in a one-on-one format by trained professionals, such as psychologists, developmental pediatricians, occupational therapists, or speech-language pathologists, who engage the through play-like tasks using and manipulatives to elicit natural responses. The occurs in a quiet, distraction-free , often in a clinical or home setting, with the present but instructed not to assist or prompt the during . Items are presented in order of increasing difficulty, with start points adjusted based on the child's chronological or corrected age to optimize efficiency. Administration typically lasts 30 to 70 minutes, depending on the child's age (from 16 days to 42 months) and the specific edition, though earlier versions like the Bayley-III may extend up to 90 minutes for comprehensive evaluations. Essential materials include a kit comprising an manual, technical manual, stimulus books, record forms, motor-response booklets, observational checklists, and a set of manipulatives such as blocks and toys; questionnaires are incorporated for the social-emotional and scales to gather collateral input. Options for delivery have evolved, with traditional paper-and-pencil methods available across editions, while later versions support administration via Q-global web-based platforms and telepractice guidelines, particularly in the Bayley-4, allowing remote observation with live video for the cognitive, language, and motor scales. Examiners must possess Qualification Level B credentials, generally requiring graduate-level training in , , or related fields, along with specific instruction in Bayley procedures through workshops or online programs to ensure standardized delivery. is a key focus, with studies reporting coefficients of 0.67 to 0.81 across Bayley-4 subscales, underscoring the importance of consistent scoring practices among qualified administrators. Scoring procedures generate raw scores that are converted to scaled scores (mean of 10, standard deviation of 3) for individual subtests and composite scores (mean of 100, standard deviation of 15) for broader domains, alongside ranks, age equivalents, and growth scale values to track developmental progress over time. Earlier editions, such as the Bayley-III, employ dichotomous scoring (0 or 1 point per item based on pass/fail), whereas the Bayley-4 introduces polytomous scoring (0 for absent, 1 for emerging, 2 for mastery) to provide finer-grained of skill levels. Manual scoring is possible, but digital tools like Q-global automate calculations and generate reports. Adaptations account for prematurity by using corrected age up to 24 months, and while core norms derive from U.S. English-speaking samples, cultural and linguistic versions have been developed and validated for diverse populations, such as adaptations for Kenyan children aged 18-36 months to ensure applicability across contexts.

Historical Development

Origins and First Edition

The Bayley Scales of Infant Development originated from the pioneering work of psychologist Nancy Bayley during the 1960s at the . Bayley's development of the scales drew heavily from her earlier longitudinal research, particularly the Berkeley Growth Study initiated in 1928, which tracked the physical, mental, and behavioral growth of 61 infants over decades. This study revealed that early developmental patterns in infancy exhibited no significant demographic biases across socioeconomic, ethnic, or gender lines, with the exception of minor advancements in observed among African American infants compared to their Caucasian counterparts. The first edition of the Bayley Scales of Infant Development (BSID-I), published in 1969 by The Psychological Corporation, was designed to assess developmental milestones in and toddlers aged 2 to 30 months. It consisted of three primary components: the Mental Scale, which included 163 items evaluating cognitive, perceptual, and problem-solving abilities through tasks such as object exploration and simple imitation; the Motor Scale, comprising 111 items that measured fine motor skills (e.g., grasping objects) and (e.g., crawling and standing); and the Infant Behavior Record, a supplementary observational tool with 30 ratings capturing aspects of , attention, and social responsiveness during testing. The scales emphasized play-based interactions to engage young children naturally, with items sequenced by increasing age norms derived from Bayley's prior California First-Year Mental Scale (1933) and California Infant Scale of Motor Development (1936). Standardization occurred on a normative sample of 1,712 typically developing U.S. children, stratified by age, sex, and geographic region, enabling the derivation of age-equivalent scores and developmental quotients. Scores from the BSID-I were reported as the Mental Development Index (MDI) and Psychomotor Development Index (PDI), both standardized with a of 100 and a standard deviation of 16, functioning as developmental quotients to identify deviations from typical progress. While the scales provided a reliable snapshot of early development, they had notable limitations, including a restricted age range that precluded assessment beyond 30 months, the absence of a dedicated or communication domain (with linguistic elements embedded only within the Mental Scale), and rudimentary behavioral observations limited to qualitative examiner notes rather than structured metrics. Early research also highlighted poor , as BSID-I scores at 2 years correlated weakly (r ≈ 0.30–0.50) with later measures around school age, particularly in high-risk populations.

Standardization Across Editions

The standardization of the Bayley Scales of Infant Development has consistently relied on stratified normative samples drawn from the U.S. population, designed to reflect data on factors such as age, sex, race/ethnicity, (via parent education and income), and geographic region. Early editions excluded children from most extreme at-risk groups, such as those with significant disabilities, to establish norms for typically developing infants; this approach shifted in later versions to enhance representation of diverse developmental profiles. Reliability testing across editions has emphasized , test-retest stability, and inter-rater agreement, with coefficients generally indicating strong psychometric properties suitable for clinical and research use. The first edition (1969) was normed on a sample of 1,712 typically developing children aged 2 to 30 months, stratified by age, sex, race, geographic region, and urban-rural residence. Internal consistency for the Mental and Motor scales was high, with split-half reliability coefficients ranging from 0.81 to 0.93. The second edition (1993) expanded the normative sample to over 1,200 children aged 1 to 42 months, maintaining stratification by demographic variables aligned with 1980 U.S. census data and excluding children with known disabilities. Test-retest reliability ranged from 0.80 to 0.85 across scales, while internal consistency for the Mental Scale averaged around 0.88 (median), with coefficients from 0.78 to 0.93 by age group; behavioral observation norms were introduced for the first time. The third edition (2006) utilized a normative sample of 1,700 children aged 1 to 42 months (16 days to 42 months 15 days in some reports), stratified according to 2000 U.S. data and including approximately 10% at-risk children without established risk factors dominating the norms. Separate norms were developed for the caregiver-completed questionnaires on social-emotional and . Internal consistency coefficients ranged from 0.91 to 0.97 for core scales like Cognitive, , and Motor. The fourth edition (2019) featured a normative sample of 1,700 children aged 16 days to 42 months, stratified based on U.S. data with an average household income closely matching the national median ($62,835 vs. $62,175). For the first time, it intentionally included diverse at-risk groups, such as 21 children with (1.2% of the sample), to better represent real-world developmental variability without skewing overall norms. Reliability metrics showed from 0.93 to 0.99 across subtests, test-retest coefficients of 0.81 to 0.87 (over short intervals), and of 0.67 to 0.81. This edition marks a shift toward greater inclusivity in , though cultural adaptations for non-U.S. populations, such as separate norms in other countries, remain supplementary rather than integral to the core U.S.-based framework.

Second Edition (1993)

Key Features and Changes

The second edition of the Bayley Scales of Development (BSID-II), published in 1993, expanded the assessment's applicability by widening the age range from 2 to 30 months in the original edition to 1 to 42 months, allowing for earlier identification and longer-term monitoring of developmental progress. This extension accommodated a broader spectrum of and development, including premature from as early as 1 month corrected age. The BSID-II retained the core structure of the Mental and Motor scales while introducing refinements to enhance and administration efficiency. The Mental Development Index (MDI) scale, which blends cognitive and early abilities such as sensory perception, , problem-solving, and verbal , consists of 178 items of increasing difficulty administered through play-based tasks. The Psychomotor Development Index (PDI) scale focuses on gross and fine motor skills, including coordination and control, with 111 items. A key addition was the Behavior Rating Scale, an observational tool with 30 items assessing qualities like attention, activity level, persistence, and emotional regulation during the testing session, providing a qualitative profile rather than a composite score. These scales incorporated updated items and stimulus materials for improved cultural relevance and ease of use, with approximately 75% of the original Mental Scale items retained alongside new ones to better reflect contemporary developmental norms. Scoring for the MDI and PDI yields standard scores with a mean of 100 and standard deviation of 15, enabling comparison to age-based norms, while the Behavior Rating Scale generates a descriptive without numerical indexing. The assessment maintains a play-based design, engaging children in natural interactions with toys and objects to minimize stress, and typically requires 30 to 60 minutes for administration depending on the child's age and cooperation.

Psychometric Properties

The psychometric properties of the Bayley Scales of Infant Development, Second Edition (BSID-II) demonstrate moderate to high reliability for its core indices, though with some variability across groups and intervals. for the Mental Developmental Index (MDI) and Psychomotor Developmental Index (PDI) ranges from 0.88 to 0.93, as reported in the test manual based on split-half reliability analyses of the standardization sample. Test-retest reliability over 1- to 2-week intervals is 0.77 to 0.83, reflecting stable short-term performance but potential sensitivity to minor developmental fluctuations in infants. The BSID-II was standardized on a normative sample of 1,700 children 1 to 42 months, representative of the U.S. population in terms of , , /, and . Validity evidence for the BSID-II includes content validity established through expert reviews during scale development, ensuring items align with established domains of infant cognition and motor skills. Concurrent validity is supported by moderate correlations with earlier developmental assessments, such as a Pearson r of 0.70 with the Gesell Developmental Schedules in comparative studies of young infants. However, predictive validity for school-age outcomes is limited, particularly in high-risk populations; for example, an MDI score below 70 at 20 months' corrected age predicts (defined as a score below 70 on the ) at 8 years with a positive predictive value of only 37% in extremely low birth weight (ELBW) infants overall, dropping to 20% in those without neurosensory impairments. Key limitations of the BSID-II include a tendency to underestimate developmental abilities in typically developing children compared to later editions, potentially due to conservative normative adjustments. Floor effects are pronounced for severely delayed infants, limiting differentiation at the lower end of the scale and reducing utility for profound impairments. Additionally, the Behavior Rating Scale shows weak predictive power for long-term outcomes, with low correlations to cognitive or adaptive functioning beyond infancy. Specific studies underscore these properties in clinical contexts. A 2005 of 200 ELBW infants ( ≤1,000 g) found a low (r ≈ 0.40) between BSID-II MDI scores at 20 months and full-scale IQ at school age in preterm groups without neurosensory issues, highlighting the scale's challenges in forecasting later amid rapid early . In applications to infants with , research using a modified BSID-II (BSID-M) that excluded motor-influenced items improved cognitive accuracy by addressing overlaps between motor and cognitive domains, where standard items often confound gross motor delays with intellectual functioning.

Third Edition (2006)

Improvements Over Prior Editions

The third edition of the Bayley Scales of Infant and Toddler Development (Bayley-III), published in 2006, introduced a restructured framework that expanded from the three primary scales of the second edition (BSID-II)—Mental, Motor, and Behavior Record—to five distinct domains: Cognitive, Language, Motor, Social-Emotional, and Adaptive Behavior. This separation allowed for more precise evaluation of developmental areas, with the Cognitive scale comprising 91 standalone items assessing sensorimotor development, problem-solving, and early concept formation through direct observation; the Language scale divided into Receptive (49 items) and Expressive (48 items) subscales to isolate communication skills; and the Motor scale split into Fine (66 items) and Gross (72 items) subscales focusing on perceptual-motor and locomotion abilities, respectively. The Social-Emotional domain utilized a 35-item caregiver questionnaire to gauge emotional regulation and social interactions, while the Adaptive Behavior domain employed a 241-item caregiver questionnaire to measure practical daily living skills across conceptual, social, and practical domains. These changes addressed prior limitations in the BSID-II by providing domain-specific composite scores (mean of 100, standard deviation of 15), enabling clinicians to identify targeted delays rather than broad developmental quotients. Administration methods were refined to incorporate direct assessment for the core Cognitive, , and Motor domains—observing the child's interactions with and tasks—while relying on indirect caregiver reports for Social-Emotional and , reducing overall burden and enhancing . Items were updated based on contemporary developmental research, incorporating evidence from longitudinal studies to reflect evolving milestones in , such as refined tasks for and . The age range remained 1 to 42 months, but finer stratification into 17 age bands (10-day intervals up to 5 months, 1-month bands from 6 to 35 months, and 3-month bands from 36 to 42 months) improved norming precision using combined and , allowing for more accurate basal and ceiling determinations. These design shifts emphasized early detection of specific impairments, with time shortened for certain scales (e.g., 30-60 minutes for core domains in younger infants compared to the BSID-II's broader 45-60 minutes per scale). A key improvement was the enhanced differentiation between language and cognitive abilities, previously conflated in the BSID-II's Mental scale, which often masked isolated language delays in at-risk populations like preterm infants. By isolating these domains, the Bayley-III better aligned with diagnostic criteria for conditions such as , supported by validation studies showing moderate to strong correlations (r = 0.60-0.80) between its Language composites and standardized tools like the Preschool Language Scale-4. However, this edition addressed BSID-II criticisms of normative sample biases—such as excluding clinical cases—by including 9.8% of participants with mild developmental concerns, yielding more representative standards. Despite these advances, the Bayley-III introduced score inflation in typically developing children, with average Cognitive and Language composites approximately 10-15 points higher than BSID-II equivalents, potentially underidentifying delays unless adjusted cutoffs (e.g., <85 instead of <70) are applied.

Subscales and Content

The Bayley Scales of Infant and Toddler Development, Third Edition (Bayley-III), features five primary scales that evaluate developmental domains through a combination of direct observation, interaction, and reports, with tasks age-graded from 1 to 42 months to assess progress in typical milestones. Each scale yields scaled scores that contribute to composite indices, providing a structured framework for identifying strengths and delays without relying on complex equations for derivation. The Cognitive Scale comprises 91 items focused on problem-solving, exploration, memory, and attention, administered through interactive play-based tasks that encourage the child's engagement with objects and the examiner. Examples include demonstrating by searching for a partially hidden or imitating simple actions like hands, which gauge the infant's ability to understand cause-and-effect and spatial relationships as development progresses. These items span sensorimotor activities in early months to more abstract concept formation in toddlers, emphasizing conceptual understanding over rote memorization. The Language Scale is divided into Receptive Communication (49 items) and Expressive Communication (48 items) subscales, evaluating the child's ability to comprehend and produce through verbal prompts, gestures, and visual cues. Receptive tasks involve following directions, such as pointing to named body parts or identifying pictures in a book, while expressive tasks assess naming objects, gesturing needs, or vocalizing words like responding to one's name. This structure highlights vocabulary growth and communication intent, with items calibrated to detect nuances in auditory processing and from to simple sentences. The Motor Scale includes Fine Motor (66 items) and Gross Motor (72 items) subscales, measuring precision and coordination through hands-on activities that test physical manipulation and . Fine motor tasks feature grasping small objects, drawing scribbles, or stacking blocks to evaluate pincer grip and dexterity, whereas gross motor items assess milestones like sitting unsupported, crawling, or walking independently. These age-appropriate challenges provide insights into neuromuscular development, prioritizing functional mobility over athletic prowess. The Social-Emotional Scale, a 35-item derived from the Greenspan Social-Emotional Growth Chart, assesses emotional regulation, play behaviors, and social interactions via parent ratings of typical scenarios. Items cover responses to , such as smiling at familiar faces or engaging in , to identify competencies in self-comforting, , and peer relations across developmental stages. This report-based approach complements direct observation by capturing everyday emotional dynamics in home environments. The Adaptive Behavior Scale, consisting of 241 caregiver-reported items inspired by the , evaluates practical skills in communication, daily living, and socialization through checklists of routine activities. Domains include self-feeding with utensils, dressing with assistance, or initiating social greetings, offering a comprehensive view of independence and interpersonal functioning tailored to the child's age. By focusing on real-world application, this scale underscores adaptive strengths essential for later autonomy.

Fourth Edition (2019)

Major Updates

The fourth edition of the Bayley Scales of Infant and Toddler Development (Bayley-4), released in 2019, introduced several structural and methodological enhancements over the third edition (Bayley-III) to improve assessment precision, inclusivity, and practicality. These updates were informed by recent developmental research, user feedback, and advances in assessment technology, aiming to better capture nuanced developmental performance in infants and toddlers. A key refinement in scoring involves shifting from the dichotomous (pass/fail) system of the to a polytomous scale, where items are scored as 0, 1, or 2 points to allow partial credit for emerging or approximate skills. This change enables more granular evaluation of performance, particularly for children with subtle delays or atypical development patterns. Normative data in the Bayley-4 were expanded to include greater representation of children with disabilities, notably incorporating 21 cases of children with (1.2% of the sample) into the cognitive, language, and motor norms. This addition addresses limitations in prior editions by enhancing the tool's applicability to diverse populations and reducing bias in scoring for neurodevelopmental conditions. The adaptive behavior component was updated to draw from the –Third Edition (Vineland-3), replacing the Adaptive Behavior Assessment System–Second Edition (ABAS-II) used in the Bayley-III. This revision incorporates 120 streamlined items across domains such as communication, daily living skills, , and motor skills, shortening the questionnaire while maintaining comprehensive coverage of real-world functioning. Practical administration was modernized through integration with the Q-global digital platform, which supports web-based scoring, reporting, and remote completion of the Social-Emotional and Adaptive Behavior scales. New telepractice guidelines were also developed to facilitate remote assessments, and the kit was streamlined by eliminating separate record forms and manuals, reducing materials while improving workflow efficiency. The age range was extended downward from 1 month to 16 days, allowing earlier identification of potential delays in neonates. Item content was refined for efficiency, with the retaining 81 items, the Language scale 79 items, and the Motor scale 104 items, achieved by removing redundant tasks and optimizing progression. Floor and ceiling effects were improved through adjusted basal and ceiling rules—requiring three consecutive 2-point responses for the basal and five consecutive 0-point responses for the ceiling—providing a wider score range without excessive item administration. Additionally, items were revised based on evidence from and developmental studies to ensure alignment with current understandings of early and motor development.

Current Applications and Validity

The Bayley Scales of Infant and Toddler Development, Fourth Edition (Bayley-4), serves as a primary tool for early intervention programs, particularly in identifying developmental delays among high-risk populations such as (NICU) graduates and preterm infants. It is widely applied in clinical settings to guide individualized intervention plans, including for children showing early signs of autism spectrum disorder through assessment of social-emotional and cognitive domains. Post-2020 adaptations have facilitated administration, with official guidance enabling remote completion of caregiver questionnaires for social-emotional and scales via secure digital platforms, enhancing accessibility during the . In research, the Bayley-4 supports longitudinal studies on preterm outcomes, tracking motor and to evaluate intervention efficacy. As of 2025, it is used in clinical trials for neurodevelopmental disorders such as , assessing outcomes in special populations. Reliability evidence for the Bayley-4 demonstrates strong psychometric stability across domains. Internal consistency coefficients range from 0.93 to 0.99 for the cognitive, language, and motor scales, indicating high item homogeneity. Test-retest reliability, assessed over intervals of 1-2 weeks, yields coefficients of 0.81 to 0.87 for these scales, reflecting consistent scores over short periods. Inter-rater reliability, based on independent scoring by trained examiners, ranges from 0.67 to 0.81 across subdomains, supporting dependable administration in diverse clinical contexts. Validity studies affirm the Bayley-4's alignment with established developmental constructs. is evidenced by correlations of 0.69 to 0.75 with the Bayley-III scales, confirming score comparability across editions. is supported by correlations of 0.45 to 0.79 with the Wechsler Preschool and Primary Scale of Intelligence, Fourth Edition (WPPSI-IV) in older toddlers, particularly in cognitive and language domains. derives from expert reviews and literature-based item selection, ensuring relevance to contemporary developmental milestones. studies indicate moderate correlations with later developmental outcomes. Recent research highlights the Bayley-4's performance in diverse populations, with reduced cultural biases through diverse samples. Despite these strengths, limitations persist in high-risk cohorts. Long-term predictive accuracy remains poor for extremely (ELBW) children, with cognitive outcomes often diverging significantly by school age. The instrument is not recommended for children over 42 months, as norms and items are optimized for infants and toddlers up to this age.

References

  1. [1]
    Bayley Scales Of Infant and Toddler Development - StatPearls - NCBI
    Bayley Scales of Infant and Toddler Development is an extensive formal developmental assessment tool for diagnosing developmental delays in early childhood.Definition/Introduction · Issues of Concern · Clinical Significance
  2. [2]
    Bayley-4 - Bayley Scales of Infant and Toddler Development | Fourth Edition | Pearson Assessments US
    ### Summary of Bayley Scales of Infant and Toddler Development | Fourth Edition
  3. [3]
  4. [4]
    [PDF] Norm-Referenced Assessment Tools for Children Birth to Age Five ...
    Oct 22, 2020 · The DEC Recommended Practices and IDEA require that multiple sources of information be used to determine eligibility for early intervention or.<|control11|><|separator|>
  5. [5]
    Beyond the Bayley: Neurocognitive Assessments of Development ...
    The scales are intended to assess three major areas of development: cognitive, language and motor (Bayley, 1969; 2006). Although currently the most used option ...
  6. [6]
    Bayley Scales of Infant and Toddler Development, Fourth Edition
    Aug 31, 2022 · Highly recommended for developmental, psychological, and neuropsychological studies of infants and toddlers up to 42 months old. Highly ...
  7. [7]
    Cultural adaptation of the Bayley Scales of Infant and Toddler ...
    Cultural adaptation of the Bayley Scales of Infant and Toddler Development, 3rd Edition for use in Kenyan children aged 18–36 months: A psychometric study.
  8. [8]
  9. [9]
    The Berkeley Growth Study - jstor
    (36) Bayley, N.: The California infant scale of motor development. University of California Syllabus Series, No. 259, 1936, pp. 11. (37) Wolff, L. V. ...
  10. [10]
    Nancy Bayley: Pioneer in the measurement of growth and ...
    This chapter takes a look at the lifework and contributions of Nancy Bayley to the field of psychology and the study of human growth and development.Missing: origins | Show results with:origins
  11. [11]
    to 22-month-old black, firstborn male infants on two tests of cognitive ...
    Aug 7, 2025 · The mean Bayley psychomotor score was significantly greater than that of the standardization sample at all 3 age levels. Variability on both the ...<|control11|><|separator|>
  12. [12]
    Bayley Scales of Infant Development - ScienceDirect.com
    The Bayley Scales of Infant Development (BSID) is the most widely used measure to assess neurodevelopment of very preterm (gestational age ≤ 32 weeks) and very ...
  13. [13]
    Bayley, Nancy | Encyclopedia.com
    It became famous as the Berkeley Growth Study. Her 1933 publication, The California First-year Mental Scale, was followed in 1936 by The California Infant Scale ...Missing: origins | Show results with:origins<|control11|><|separator|>
  14. [14]
  15. [15]
    Poor Predictive Validity of the Bayley Scales of Infant Development ...
    Aug 1, 2005 · Poor Predictive Validity of the Bayley Scales of Infant Development for Cognitive Function of Extremely Low Birth Weight Children at School Age ...
  16. [16]
    None
    ### Summary of Bayley-4 Standardization, Normative Samples, Reliability, and Comparisons
  17. [17]
    [PDF] Comparison of the Revised and Original Versions of the Bayley ...
    The original Bayley Scales have proven to be a sound instrument and have been frequently utilized in research studies with a variety of clinical samples, ...Missing: details | Show results with:details
  18. [18]
    Test Review: Bayley-III - LEADERSproject
    Nov 25, 2013 · The Adaptive Behavior Scale is based on the ABAS-II. The standardization sample consisted of 1350 children aged 0-71 months. Children were ...Missing: 1712 | Show results with:1712<|separator|>
  19. [19]
    Acceptability and Reliability of the Bayley Scales of Infant ... - Frontiers
    The inter-rater reliability coefficients were excellent for both the standardization sample (ICC = 0.99) and for the quality control throughout the study (ICC = ...
  20. [20]
    Taiwanese Infants' Mental and Motor Development—6–24 Months
    Aug 21, 2007 · The BSID-II is composed of the Mental Scale (178 items), the Motor ... The Bayley II was normed before the recommendation of American ...
  21. [21]
    [PDF] Developmental Trajectories of Brain Structure, Motor, and Cognitive ...
    Aug 6, 2022 · children); early cognitive function (measured by Bayley-II at 12 months) had a significant ... (178 items),. Motor (111 items), and Behavior ...
  22. [22]
    Comparison of Second and Third Editions of the Bayley Scales in ...
    Apr 30, 2018 · The Bayley Scales of Infant Development (BSID) is the most widely used developmental assessment test for infant and children. The first edition ...
  23. [23]
    Understanding the Bayley Scales of Infant Development
    Oct 27, 2025 · The Bayley Scales of Infant and Toddler Development stand as an essential tool in the early identification and intervention for developmental ...
  24. [24]
    [PDF] Early Childhood Longitudinal Study, Birth Cohort (ECLS–B ...
    SOURCE: Standardization dataset for the Bayley Scales of Infant Development, Second Edition (BSID-II), The Psychological Corporation, 1993. Valid items: 178.
  25. [25]
    A modified version of the Bayley Scales of Infant Development-II for ...
    Apr 28, 2008 · We created a modified form of the BSID-II (the BSID-M) to provide a 'purer' assessment of the general cognitive capacities in infants with Down syndrome (DS) ...Missing: overlaps | Show results with:overlaps
  26. [26]
    None
    ### Summary of Key Improvements and Changes in Bayley-III Compared to BSID-II
  27. [27]
    [PDF] ORIGINAL ARTICLE - Semantic Scholar
    cognitive (91 items), receptive (49 items) and ... Mean differences (SD), and Partial eta squared values for all subtests per Bayley-III age groups.
  28. [28]
    [PDF] III vs. Bayley™-4 - Pearson Clinical Assessment
    • 10-day age bands up to 5 months. • 1-month age bands from 6 months to 35 months. • 3-month age bands from 36–42 months. • 10-day age bands up to 6 months. • 1 ...
  29. [29]
    Using the Bayley-III to assess neurodevelopmental delay - Nature
    Feb 3, 2014 · The Bayley Scales are the most frequently used tests in infant developmental assessment. The second edition of the test, the BSID-II (1) ...<|control11|><|separator|>
  30. [30]
    Comparison of the Bayley II Mental Developmental Index and ... - NIH
    Bayley III Cognitive scores were significantly higher than Bayley II MDI scores for term and preterm toddlers combined and separately.Missing: improvements second
  31. [31]
    Relationship Between Test Scores Using the Second and ... - PubMed
    Nov 1, 2011 · Results: Bayley-III cognitive and language scores were 10 and 3 points higher than MDI scores, respectively; CB-III scores were 7 points higher.
  32. [32]
  33. [33]
    Test review: Bayley, N.(2006). Bayley scales of infant and toddler ...
    ... Cognitive scale, resulting in a total of 91 items. Forty-five of the items from the Mental scale were completely removed from the Bayley-III, whereas the ...
  34. [34]
  35. [35]
  36. [36]
    Bayley-4: New Product Announcement - Pearson Support
    A reduced number of items and improved workflow. The shortened adaptive behavior content from the Vineland-3, saving the caregiver significant time. Updated ...
  37. [37]
    Does the Degree of Prematurity Relate to the Bayley-4 Scores ...
    Nov 8, 2023 · The literature on children born prematurely has consistently shown that full-term babies outperform preterm babies by about 12 IQ points, even when tested as ...Missing: telehealth | Show results with:telehealth
  38. [38]
  39. [39]
  40. [40]
    Neonatal Care Unit Interventions on Preterm Development - MDPI
    Jun 2, 2023 · Early intervention programs focusing on uni-modal sensory input have demonstrated specific benefits on neurobehavioral functions in the preterm ...
  41. [41]
  42. [42]
    None
    ### Reliability and Validity of Bayley-4
  43. [43]
    The predictive validity of Bayley Scales of Infant and Toddler ...
    The study examined the predictive validity of the Bayley Scales of Infant and Toddler Development (BSID) 3 rd edition on later tests of general abilities.<|separator|>
  44. [44]
    The stability of the Bayley scales in early childhood and its ...
    Strengths of our study is the high-quality measurements with excellent inter-rater agreement [8] reducing random measurement errors, and the large sample size ...
  45. [45]
  46. [46]
    insights from a comparative study of the Bayley Scales (BSID-III) and ...
    Mar 1, 2025 · ... early intervention programs aimed at promoting optimal development ... This finding aligns with the idea that early developmental ...
  47. [47]
    Bayley Scales of Infant and Toddler Development | Fourth Edition
    ### Summary of Bayley-4: Current Applications, Telehealth Adaptations, Reliability, and Validity