When people talk with each other, they move—they may tilt their head towards each other, lean forward, shake their head for “no” or shrug a shoulder to express disinterest or lack of comprehension. This multimodal body-language embodied perception of a conversation requires coordination between the two interlocutors’ words and bodily actions to create an effective interactive process (Mondada,
2016). As such, each speaker’s verbal expression triggers the other’s corresponding motor action and vice versa; thus, language, gesture, gaze, head movements, facial expressions, body posture, and body movements cojoin to create mutual understanding between the interlocutors (Garrod & Pickering,
2009; Pickering & Garrod,
2004). Overall, children create shared knowledge known as “common ground” during a conversation, and they also coordinate their action in time and space to create “joint action” (Okabayashi,
2017). Both common ground (CG) and joint action (JA) seem to be crucial aspects of effective peer interactive conversation, but their cojoined contribution to peer-to-peer communication has been overlooked. Little empirical attention has focused on the mechanisms by which each interlocutor’s own bodily movements and the dyad’s ability to synchronize their motor coordination (JA) may contribute to the effective creation of CG during peer dialogue.
Moreover, little is known about the cojoined contribution of these variables—each interlocutor’s individual’s motor functioning together with the partners’ CG and JA—for successful peer communication among children and adolescents with autism spectrum disorder (ASD), who possess major challenges in peer interaction as a defining criterion (American Psychiatric Association,
2022). Therefore, this study aimed to empirically explore such language-motor links for this highly relevant population. Autism is characterized by challenges in social communication and restricted behaviors (
DSM-V-TR, American Psychiatric Association,
2022) as well as by documented wide-ranging motor and motor coordination challenges (Bhat,
2020,
2021; Reynolds et al.,
2022; Wang et al.,
2022).
Common Ground: Characteristics and Development in TD and ASD
CG refers to the shared meaning constructed by two interlocutors during a conversation, which is built upon their use of various speech acts such as an affirmation to an initiation, question asking, and correction of misunderstandings (Clark & Krych,
2004; Clark & Wilkes-Gibbs,
1986). Inasmuch as CG involves establishing shared knowledge and mutual understanding between speakers, its creation through conversational grounding makes everyday interactions more efficient. Difficulties in CG can result in lengthy, exhausting conversations that demand substantial effort to correct communication breakdowns (Clark & Wilkes-Gibbs,
1986; Okabayashi,
2017; Schuh et al.,
2016).
Creating CG between two interlocutors necessitates the coordination of language, cognitive, and motor skills, and it involves recalling memory, delaying information, responding to gestures, and grasping the conversational context (Nilsen & Graham,
2009). In TD children, CG gradually evolves from early childhood (age 5 years) to preadolescence (age of 12 years) when children can initiate conversation corrections, offer clarifications, provide descriptions or gestures, and consider their partner’s perspective when facing gaps in their shared knowledge (Khu et al.,
2020; Nadig & Sedivy,
2002).
Referential communication tasks are common procedures for CG evaluation (Abbot-Smith, et al.,
2020; Bovet et al.,
2024; Malkin et al.
2018a; Nilsen & Graham,
2009). They involve information exchanges where one speaker describes an unseen object, process, or image to another while relying on the ability to maintain and update CG representations with the partner (Schuh et al.,
2016). Clark and Wilkes-Gibbs’s (
1986) well-accepted referential communication task instructs pairs of TD adults to convey information (hidden from one partner) on geometric configurations’ arrangement, using tangram cards over six turns. Findings showed that neurotypical adults exhibited the benefits of CG creation over the course of this referential task by using the shortening effect, which reduces partners’ number of words and time needed to communicate and share knowledge effectively (Clark & Krych,
2004).
In autistic individuals, studies using various referential methods (such as tangram cards, computer games, and board games) have consistently shown less efficient CG creation compared to TD peers (Eigsti et al.,
2011; Fukumura,
2016; Malkin et al.,
2018b). For instance, in Nadig et al.’s (
2009) referential communication task involving the creation of shared information and a guessing game, one participant was asked to give the partner a clue about a “secret” object hidden from the partner’s view. Autistic children and adolescents (ages 9–14 years) were found to use fewer informative terms and to include more unnecessary details when identifying objects, compared to their TD peers.
Referential communication shortening tasks were found to correlate with age and symptom severity in ASD; thus, older participants with less severe autism traits exhibited better referential shortening (De Marchena & Eigsti,
2016), calling for further investigation of the differences between ASD and TD youngsters’ developmental CG trajectories. Indeed, in the few available empirical studies on age trajectories for CG development in ASD, its creation was found to improve with age, with differences observed between younger (school-age) and older (adolescent) groups (Arnold et al.,
2009; Fukumura,
2016). In Arnold et al.’s (
2009) study, when describing a cartoon scenario to an adult partner, autistic children ages 9–12 years produced fewer references and used significantly more over-informative language compared to their TD age-mates; however, these group differences were not present in adolescents ages 13–17 years while communicating with an adult. Fukumura (
2016) reported a developmental trend for CG among autistic participants, where younger children (6–10 years) tended to provide more unnecessary or missing information during referential tasks compared to older ASD participants (11–16 years) and to TD participants of all ages (6–16 years). This indicated that understanding of CG with an adult partner evolves along development for individuals with ASD. Other research on adults found that individuals with autism can establish shared knowledge but tend to do so more slowly and less efficiently than TD adults (Heasman & Gillespie,
2019; Nadig et al.,
2015; Wadge et al.,
2019).
In sum, research to date is quite limited on the process of CG creation during referential tasks and its development with age in both autistic and non-autistic populations (Arnold et al.,
2009; Fukumura,
2016). Moreover, the available CG research in both groups has mainly focused on child–adult or adult-adult communication rather than children’s peer-to-peer interactions, which pose greater social demands especially for individuals with autism because adults tend to help mediate the interaction (De Marchena & Eigst,
2016). Thus, the performance of CG in autism during peer-to-peer interaction may be different and more challenging from the formation of CG with an adult as examined in most previous studies. Furthermore, prior research on CG growth during children’s peer-to-peer tasks and its developmental trajectory in ASD and TD has not yet sufficiently examined CG’s possible links with motor skills and JA synchronization in the context of the dyadic peer interaction. Overall, our study's novel contribution is its examination of the process of CG creation over 6 turns, in the context of peer-to-peer interaction, in children and adolescents, comparing ASD and TD, as well as the understanding of language-CG links with motor skills and synchronization.
Motor Ability and Joint Action in TD and ASD
Although motor challenges are not among the formal criteria for diagnosing autism, recent reviews have underscored this population’s profound difficulties in gross-motor coordination, locomotor skills, fine-motor coordination, gait and posture, imitation, pantomime, and motor planning throughout the developmental stages from preschool to adulthood (Bhat,
2020,
2021; Licari et al.,
2020; Zampella et al.,
2020). Motor irregularities are already evidenced during infancy and toddlerhood by autistic individuals’ delayed acquisition of motor milestones. Later, despite a slight reduction in the motor dysfunction of autistic individuals with age (Fournier et al.,
2010), motor challenges remain apparent in adolescence (Estrugo et al.,
2023) and adulthood (Linke et al.,
2020).
Beyond the individual’s own fundamental motor abilities, the child’s capacity to coordinate movements in time and space with a partner to achieve a shared goal—JA synchronization with a peer—underlies every human interaction (Knoblich & Sebanz,
2008). JA is built upon the individual’s ability to track, interpret, and predict their partner’s social-communicational and motoric intentions (Azaad & Sebanz,
2023). These capabilities are challenging for autistic individuals, as seen in several recent reviews that demonstrated significantly better JA performance in TD compared to autistic age-mates, alongside delayed maturation of JA in ASD (Cerullo et al.,
2021; McNaughton & Redcay,
2020). A recent study supported this TD group advantage over the ASD group for dyadic motor coordination among children ages 6–16 years while performing JA tasks like shared walking and playing imaginary soccer with a same-age peer partner (Bar Yehuda & Bauminger-Zviely,
2022).
Language-Movement Links: Common Ground, Motor Skills, and Joint Action
As noted, effective communication in a conversation activates both motor and language coordination skills, which contribute to the conversation’s flow and reciprocity. For example, the interplay between lexical, postural, and gestural mental representations is important to reach shared understanding between interlocuters (Galantucci & Sebanz,
2009; Garrod & Pickering,
2009). Each partner in the conversation coordinates talking with their own bodily movements but also coordinates their talk and motor actions with the other person’s timing, rhythm, and spatial orientation (Mondada,
2016). During a conversation, partners often employ joint motor and language actions, and they may reach a higher level of coordination by imitating and synchronizing their voice volume, gestures, gaze direction, and body posture (Okabayashi,
2017; Wynn et al.,
2018). This motor-language coordination may impact partners' ability to cooperate, establish social relationships, and empathize with each other (Garrod & Pickering,
2009; Pickering & Garrod,
2004).
CG is a pragmatic ability, which denotes the communicative use of language for an efficient interaction. Pragmatic language encompasses a diverse range of abilities, such as mastering reciprocal conversational skills like turn-taking and adapting word choice according to different conversational contexts (De Marchena & Eigsti,
2016). Deficiencies and delays in both motor and language-pragmatic functioning were documented for autism. For instance, caregivers of almost 80% of autistic toddlers (mean age: 18.7 months) raised concerns about motor and language development before their toddler’s diagnosis (Herkert et al.,
2023). Likewise, caregivers reported that their autistic toddlers (mean age: 2.3 years) presented significantly delayed attainment of both motor and speech developmental milestones (Matson et al.,
2013). At later ages, the conversations of children and adolescents with autism (ages 9–16 years) showed both reduced movement (e.g., tempo similarity, coordination, simultaneous movement, posture congruence) as well as lower verbal synchrony (e.g., balanced and reciprocal conversations, coordinated pauses in speech) compared to same-age TD children (Zampella et al.,
2020).
Moreover, these two skill areas have been linked in some prior research. Caregiver-reported motor and language-pragmatical performance by autistic children and adolescents (2–17 years) highlighted a positive association between fine-motor performance and expressive and receptive language skills (Mody et al.,
2017). Also, autistic children and adolescents (22 months to 16 years) who participated in motor-based interventions demonstrated improvements in language proficiency skills such as social responsivity, word production, and linguistic flexibility (Odeh et al.,
2022). Though the link between motor and pragmatic mechanisms in autism may postulate an interesting perspective on their peer-interaction functioning, the nature of the connection between JA and CG and the contribution of motor functioning to pragmatic coordination has not yet been sufficiently explored.
Current Study
Using growth curve analyses for both ASD and TD groups, this study examined peer dyads’ process of CG creation evolving over a six-turn task, as reflected in dyads’ reduction of the number of words and time duration needed to successfully communicate referential information. We also investigated group differences (ASD, TD) and age differences (early childhood, preadolescence, adolescence) in dyads’ CG creation outcomes—by calculating word count and turn duration at the end of the task (the mean of the fifth and sixth turns) while controlling for baseline CG level (mean of the first and second turns). Lastly, we examined CG’s links with individual motor skills and dyadic JA synchronization and examined these motor variables’ contribution to CG creation. Although JA includes activation of both dyadic mirroring activities (imitating another’s movement) and dyadic complementing activities (reacting to and continuing another’s movement), we opted to use only complementary JA tasks for this study because both CG tasks and complementing JA tasks involve one partner’s action (describing a card, kicking an imaginary soccer ball) that elicits a complementary reaction (placing a card, catching an imaginary ball) from the partner.
Regarding growth in the efficacy of CG creation, we hypothesized a decrease from the first turn to the final turn in peer dyads’ number of words and turn duration in both research groups. We also expected better CG creation outcomes (at Turns 5–6, controlling for baseline) in the TD group than the ASD group, marked by shorter duration and reduced word count (e.g., De Marchena & Eigsti,
2016; Fukumura,
2016; Malkin et al.,
2018a). Furthermore, we expected that both study groups would demonstrate better CG performance with age (e.g., Dahlgren & Sandberg,
2008; De Marchena & Eigsti,
2016; Fukumura,
2016).
In addition, expecting that individual motor skills and dyadic motor coordination ability would enhance dyads’ pragmatic coordination during social interactions, we hypothesized that child's better individual motor abilities and JA skills would correlate with lower CG scores (indicating lower word count and shorter turn duration, namely better CG efficiency), in both study groups, and we expected that both motor skills and JA would significantly positively contribute to the CG creation process (Zampella et al.,
2020). Finally, considering that CG’s development is influenced by age and attains maturity during adolescence (De Marchena & Eigsti,
2016), we hypothesized that the link between JA and CG would be lower as a function of age increase.