Top

Open Access 23-04-2025 | REVIEW

A Systematic Review of Self-report Measures Used to Assess Compassion in Children and Young People

Auteurs: Jasmine Hubbard, Tamara Leeuwerik, Clara Strauss

Gepubliceerd in: Mindfulness

Abstract

Objectives

Scientific interest in compassion has increased in recent years. However, research highlights issues related to the measurement of compassion in children and young people (CYP). This systematic review aimed to describe and critically appraise measures of self- and other-compassion developed and/or validated with CYP.

Method

Medline, PsycInfo and Web of Science were searched from inception to 31 st July 2024 to identify English language, peer-reviewed psychometric papers on the development or validation of compassion measures for use with CYP. The measures were critically reviewed and rated for quality.

Results

Nineteen articles were included that outlined psychometric properties of eight compassion measures: Compassionate Engagement and Actions Scales—Adolescence (CEAS-A); Compassion Scale – Adolescence (CS-A); Compassion Scale—Child (CS-C); Self-Compassion Scale (SCS); Self-Compassion Scale – Child (SCS-C); Self-Compassion Scale—Youth (SCS-Y); Self-Compassionate Reactions Scale – Child (SCRS-C); Shortened Self-Compassion Scale—Adolescence (S-SCS-A). Quality ratings ranged from 2 to 9 out of 14. The CEAS-A received the highest rating (9), followed by the SCS-Y (8), SCS (7) and CS-C (7). No measure captured all five compassion elements and evidence for test–retest reliability and floor and ceiling effects was largely absent. Most papers assessed self-compassion and used (translated, adapted versions of) the SCS and SCS-Y.

Conclusions

This review suggests there are no psychometrically robust measures that comprehensively assess self- and other- compassion in CYP, potentially impeding research. The psychometric quality of existing compassion measures for CYP should be improved, and/or a new measure should be developed, and include the measurement of compassion for others.

Preregistration

This study is not preregistered.

Supplementary file1 (DOCX 60 KB)

Supplementary Information

The online version contains supplementary material available at https://doi.org/10.1007/s12671-025-02569-w.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

There has been increased scientific interest in compassion in recent years, with recognition of its importance in both education (Compassion in Education Foundation, 2016) and healthcare (Department of Health, 2013); there is growing evidence to suggest that compassion-focused training and intervention for teachers and/or pupils improves student wellbeing, cognitive functioning, resilience and peer relationships (e.g. Kappelmayer et al., 2022; Seekis et al., 2023), teacher stress and wellbeing (e.g. Matos et al., 2022). In healthcare, studies have shown that (training in) compassionate care reduces stress healthcare professionals’ stress and improves patient outcomes and experience (e.g. Sinclair et al., 2021).

Compassion is considered an evolutionary emotion, which is reproductively beneficial and has evolved as a crucial element of the caregiving system to nurture and protect offspring (Goetz et al., 2010). It is generally considered to be equivalent for others (other- compassion) and self (Feldman & Kuyken, 2011; Gilbert, 2014). Whilst understanding of this relationship is more limited in children and young people (CYP), findings suggest these orientations of compassion are overlapping constructs in CYP (Cunha et al., 2023). Several definitions exist in the literature, which reflect that compassion can be directed inward toward the self, or outward towards others (Roeser et al., 2018). Gilbert’s (2014) definition, “the sensitivity to suffering in self and others, with a commitment to try to alleviate and prevent it” (p. 19), encapsulates both. While Gilbert (2014) adopts an evolutionary, biopsychosocial perspective on compassion, Neff’s (2003a) conceptualisation draws on Buddhist philosophy and focuses on self-compassion; her conceptualisation of self-compassion involves three main components: self-kindness (being kind and understanding toward oneself rather than self-critical in instances of pain or failure), common humanity (perceiving one’s experiences as part of the larger human experience rather than as isolating), and mindfulness (holding painful thoughts and feelings in balanced awareness rather than over-identifying with them. These differences capture the wide scope of compassion research, which encapsulates self-compassion and interpersonal compassion and explores their associations with professional and family caregiving (e.g. Liao et al., 2022; Singh et al., 2018), social relationships (e.g. Lathren et al., 2021), and psychological wellbeing, (e.g. Zessin et al., 2015), including in the context of interventions that aim to improve compassion (e.g. Kirby et al., 2017).

Current understanding of the development of compassion is informed by research investigating the development of empathy-related responding and prosocial behaviour (Eisenberg, 2017; Roeser et al., 2018). The beginnings of compassion are thought to exist from the first year of life, shown through infants’ mirroring others’ emotions, e.g. crying when others cry (Gilbert, 2015), followed by children starting to develop rudimentary empathy, expressing concern and prosocial behaviour towards others, e.g. comforting their caregivers, in the second year of life (Eisenberg, 2000), associated with (further) self-other differentiation. In early childhood (ages 3–5 years), improving language and cognitive skills support the development of the capacity to understand others’ emotions and intentions, with children starting to show greater empathy and moral reasoning. Prosocial behaviour, particularly in familiar social environments, becomes more frequent and complex. Perspective-taking skills become more firmly established around six years of age and continue developing into adolescence. These skills are important for the development of compassion as they allow children to understand the emotions of others (Selman, 1980). Peer relationships and social awareness deepen during this time. Prosocial behavior becomes more deliberate and guided by moral reasoning and social norms (Eisenberg, 2000).

During adolescence, perspective taking skills approach adult levels. Compassion begins to mature during adolescence, as a more complex understanding of the self and others develops and as the ability to view experiences of the self and others from a third-person or societal perspective strengthens (Bengtsson et al., 2016; Selman, 1980). In line with the biopsychosocial perspective on compassion, individual differences in prosocial development are associated with temperamental factors like sociability, emotionality and shyness (including via their impact on emotion regulation) (Roeser et al., 2018). Family influences on prosocial development include attachment security, parenting styles and practices and parental affection, while broader societal influences include schools, communities, faith-based institutions (Roeser et al., 2018). Cultural influences on (expression of) compassion have also been noted (Koopmann-Holm et al., 2021).

Adolescence is a turbulent time as young people explore their identity and face societal challenges (e.g. education, future career), in the context of significant neurological restructuring. It is associated with increased vulnerability to mental health difficulties (Bluth & Clepper-Faith, 2023). Neff (2003a) proposed that self-compassion can decrease in adolescence due to increased self-reflection in relation to ideals and standards that may fuel self-criticism and feelings of isolation, which is supported by some studies (e.g. Bengtsson et al., 2016). Research also suggests gender differences, as adolescent females reported greater compassion for others and lower self-compassion than adolescent males (Bengtsson et al., 2016). Some research has shown age interacts with gender as older adolescent females report lower self-compassion than younger adolescent females and adolescent males (e.g. Bluth et al., 2017).

Whilst most compassion research has been conducted with adults (18 + years of age), research with CYP (defined as younger than 18 years of age) is growing and reflects findings with adults, suggesting self-compassion is important for the psychological and social wellbeing of CYP. For example, self-compassion in CYP has been associated with enhanced wellbeing, happiness, mindfulness, resilience, social connectedness, and mastery-oriented goals, in addition to reduced anxiety, depression, and performance-oriented goals (Marsh et al., 2018; Neff & McGehee, 2010; Neff et al., 2021). Self-compassion has also been shown to negatively correlate with peer victimization (Hatchel et al., 2019), and to moderate the relationship between peer victimization and self-harm (Jiang et al., 2016). Whilst research on other-compassion is more limited for CYP than adults, a recent study with adolescents showed positive associations with emotion regulation, wellbeing, and mindfulness (Heidary et al., 2022).

A key challenge in compassion research is lack of consensus in its definition and measurement (Gilbert & Van Gordon, 2023; Neff, 2023). The definition by Gilbert, noted above, has been operationalised in the Compassionate Engagement and Action Scales (CEAS) (Gilbert et al., 2017), which assess compassionate engagement and action in relation to self-compassion, compassion for others, and the ability to receive compassion from others, respectively. Neff’s (Neff, 2003b) Self-Compassion Scale (SCS), on the other hand, the most widely used measure of self-compassion (Neff, 2023), operationalises her definition and measures self-compassion across three overlapping bipolar domains (self-kindness vs self-judgment, common humanity vs isolation, mindfulness vs over-identification) that, together, capture a bipolar continuum from uncompassionate to compassionate self-responding when distressed (Neff, 2023). These differences in definition and measurement pose a significant challenge for compassion research (Fig. 1).

In response to this challenge, Strauss et al. (2016) consolidated eight existing, major, published conceptualizations and definitions in the literature (spanning Buddhist and Western psychological perspectives), including those by Neff and Gilbert cited above, into one multi-faceted definition of compassion as a cognitive, affective, and behavioral process consisting of five elements: (1) recognizing suffering; (2) understanding the universality of human suffering; (3) feeling empathy for the person suffering and connecting with the distress (emotional resonance); (4) tolerating uncomfortable feelings aroused in response to the suffering person (e.g., distress, anger, fear); (5) motivation to act/acting to alleviate suffering. While each element was captured across the eight published definitions, no existing definition explicitly captured all five elements. Gu et al. (2017) empirically examined the underlying conceptual structure of compassion by conducting a series of exploratory (Study 2) and confirmatory (Study 3) factor analyses with student samples, using items from existing self-report compassion measures reviewed for face validity by 15 mindfulness experts by training or experience, who also generated additional items to ensure a sufficient item pool for factor analysis (54 from four existing measures, 26 new items) that captured the five elements of the definition by Strauss et al. (2016) (Study 1). Findings supported the five-factor hierarchical structure of compassion, consistent with the definition by Strauss et al. (2016). This definition of compassion is therefore both grounded in theory and empirically supported and will be used throughout this review, to assess the content validity of measures of compassion developed and/or validated with CYP. Strauss et al. (2016) systematically reviewed nine self-report compassion measures for adults, finding that no measure comprehensively measured compassion with acceptable levels of reliability and validity. This highlighted a significant barrier to compassion research with adults and suggests similar issues may exist for measures for CYP.

Compassion research with CYP (i.e. younger than 18 years of age) is more limited than research with adults (18 years old or older), and there is particularly little research on other-compassion in CYP. This may be due to the limited validated, psychometrically robust self-report compassion measures for CYP (Neff et al., 2021) as research has mostly been conducted with older adolescents using measures developed for adults (Self-Compassion Scale [SCS]; Neff, 2003b; SCS-Short Form; Raes et al., 2011) (Neff et al., 2021). These self-report measures may use language that is too complex for CYP and/or may not capture the nuances of how CYP experience and express self-compassion and compassion for others and are therefore not necessarily suitable for assessing compassion in CYP. Measures that do not fully capture the nature of compassion, as defined by Strauss et al. (2016), and that have other psychometric weaknesses are likely to hinder research findings, thereby limiting advancements in research and practice with CYP (e.g. limiting the capacity to accurately evaluate the effectiveness of compassion-based interventions for CYP).

Meta-analyses of compassion-based intervention (CBI) randomized controlled trials (RCTs) in adults indicated that compassion is a skill that potentially can be cultivated (Ferrari et al., 2019; Kirby et al., 2017), although such findings are limited by the psychometric weaknesses of compassion measures already noted. Moreover, these meta-analyses highlighted the lack of RCTs assessing CBIs in CYP. RCTs evaluating CBIs with CYP are therefore needed, and this should include examination of potential mechanisms of action, including compassion. This further highlights the need for valid and reliable compassion measures suitable for CYP, to advance our understanding of the effectiveness and mechanisms of change of CBIs.

To the authors’ knowledge, no systematic review to date has examined the psychometric properties of self-report compassion measures developed and/or validated with CYP. This systematic review, therefore, aimed to describe and critically appraise the psychometric properties of self-report self- and other-compassion measures developed and/or validated with CYP, to facilitate understanding of their suitability for use in research and clinical practice and to identify areas for future research.

Method

Eligibility Criteria

To be selected, papers had to meet the following inclusion criteria: (i) peer-reviewed article, (ii) published in English, (iii) psychometric paper outlining the development and/or validation of a self-report measure, (iv) description of the measure is related to compassion or self-compassion, (v) majority of participants under 18. The following exclusion criteria were applied: (i) measure did not specifically assess global compassion (e.g., body compassion), (ii) exclusively adult sample, (iii) unclear if the majority of participants were under 18, (iv) non-questionnaire measure of compassion, or subscale used. Therefore, to be selected, (one of) the aim(s) of the paper had to develop and/or validate a compassion measure for CYP. In line with best practice guidance by Boateng et al. (2018), papers that validated existing self-report measures with CYP were expected to include evaluation of factor structure, (aspects of) reliability and validity. Papers that included a compassion measure, e.g. to assess the association between compassion in CYP and other variables, but did not aim to evaluate its measurement properties and/or solely reported internal consistency, were excluded.

Search Strategy

Electronic databases Medline, Psychinfo and Web of Science were searched from inception to 31 st July 2024 to identify relevant papers. Abstracts were searched using the following search terms: (compassion* OR self-compassion*) AND (measure* OR scale* OR questionnaire) AND (“young people” OR youth OR child* OR adolesce* OR teen*) AND (valid* OR reliab* or psychometric* OR “factor analys*”).

All records from the electronic search were imported to RefWorks (n.d.). After removing duplicates, titles and abstracts of remaining records were screened against the eligibility criteria. If eligibility was not clear from the title/abstract, the full-text paper was sought for retrieval and, if retrievable, read to determine eligibility. The reference lists of selected papers were hand-searched when a paper referred to additional scales and potentially relevant papers were retrieved and screened as set out above. The search and paper identification conducted by the first author were independently replicated by the second author for 80% of records, using the provided search string across the three electronic databases set out above. Records were screened against the eligibility criteria, discussed beforehand to ensure shared understanding. There was 100% agreement on selected papers across the electronic and hand search. Therefore, no reconciliation of differences was required.

Assessment of Quality

The quality of measures was reviewed and rated based on adapted quality criteria for health status measures by Terwee et al. (2007) and Barker et al. (2002) recommendations for psychological measures (Strauss et al., 2016). In addition to Terwee et al. (2007), the factor structure (i.e., structural validity) was also evaluated. The following measurement properties were evaluated:

(1)

Content validity: The extent to which compassion was comprehensively sampled by questionnaire items. This was evaluated against the five-element definition of compassion put forward by Strauss et al. (2016) (i.e. recognising suffering (R), understanding the universality of suffering (U), Feeling for the person suffering (F), tolerating uncomfortable feelings (T), acting/being motivated to act to alleviate suffering (A)), rather than the definition by the measure’s authors. The requirement for both experts by experience and training to be consulted on the items (see Table 1) reflects that expert consultation can enhance the robustness of content validity and ensure that items adequately capture all facets of compassion (Boateng et al., 2018).

(2)

Factor structure (structural validity): Whether the factor structure proposed by the authors of the original measure, reflecting how items are combined into (sub-scale and/or total), was examined and supported.

(3)

Internal consistency: The extent to which items in a (sub)scale are inter-correlated, thereby measuring the same construct.

(4)

Test–retest reliability: The extent to which participants' performance is repeatable, i.e., how consistent scores are across time.

(5)

Construct validity: The extent to which scores on the compassion measure relate to other measures consistent with theoretically derived hypotheses about the constructs being measured.

(6)

Floor and ceiling effects: The number of respondents obtaining the highest or lowest possible scores. The presence of floor or ceiling effects indicates limited content validity (Terwee et al., 2007).

(7)

Interpretability: The extent to which qualitative meaning can be attached to quantitative scores or how differences in scores on the measure can be interpreted.

Terwee et al. (2007) also provide criteria for criterion validity and responsiveness.

Table 1

Quality criteria for rating measures

Property	Definition	Quality Criteria
1. Content validity	The extent to which the domain of interest is comprehensively sampled by questionnaire items. Strauss et al.’s (2016) definition of compassion was considered, rather than the author of the measure’s definition. Components of compassion are referred to as follows: Recognising (R) = recognising suffering Universality (U) = understanding the universality of suffering Feeling (F) = feeling for the person suffering Tolerating (T) = tolerating uncomfortable feelings Acting (A) = acting/being motivated to act to alleviate suffering	These criteria were based on Terwee et al. (2007) 2: Five elements of compassion captured by items AND items generated in consultation with both experts and young people; 1: Five elements of compassion captured by items OR items generated in consultation with both experts and young people; 0: Five elements of compassion not captured AND items not generated in consultation with both experts and young people
2. Factor structure	Whether the factor structure for the compassion measure is examined and supported	These criteria were additional to those proposed by Terwee et al. (2007) and based on the criterion proposed by Strauss et al. (2016). Exploratory structural equation modelling (ESEM) and Principal Component Analysis (PCA) were also added to these criteria to aid rating. The bifactor-ESEM framework has been developed to provide more precise psychometric examination of measures (Morin et al., 2016) and is a combination of Exploratory Factor Analysis (EFA) and Confirmatory Factor Analysis (CFA) (Morin et al., 2013). PCA has a greater emphasis on data reduction than interpretation and EFA and CFA are therefore recommended to examine the factor structure (Alavi et al., 2020) 2: EFA followed by CFA and conducted in independent samples OR CFA conducted if factor structure was theoretically proposed previously OR ESEM AND factor analyses support the proposed factor structure; 1: EFA conducted without CFA and EFA supports the factor structure; 0: Factor analysis not conducted OR conducted but does not support the proposed factor structure OR PCA conducted
3. Internal consistency	The extent to which items in a (sub)scale are inter-correlated, thereby measuring the same construct	These criteria were based on Terwee et al. (2007) 2: Factor analyses conducted on an adequate sample size (7 * number of items AND N > 100) AND Cronbach’s alpha for each identified factor between 0.70 and 0.95; 1: Factor analyses not conducted on adequate sample size OR Cronbach’s alpha for each identified factor <.70 OR > 0.95; 0: No information on internal consistency George and Mallery’s (2003) rules of thumb for internal consistency are used when describing internal consistency in text: α < 0.50 unacceptable, 0.50 ≤ α < 0.60 poor, 0.60 ≤ α < 0.70 questionable, 0.70 ≤ α, < 0.80 acceptable, 0.80 ≤ α < 0.90 good, and α ≥ 0.90 excellent
4. Test–retest reliability	The extent to which participants'performance is repeatable, i.e., how consistent scores are across time	These criteria were based on Terwee et al. (2007). However, as these criteria did not include Pearson’s r, Barker et al.’s (2002) guidance was also drawn on 2: ICC OR weighted kappa ≥ 0.70; 1: ICC OR weighted kappa < 0.70 OR r ≥ 0.70; 0: r < 0.70 or no information on test–retest reliability
5. Construct validity	The extent to which scores on a measure relate to other measures in a way that is consistent with theoretically derived hypotheses about the constructs being measured	These criteria were based on Terwee et al. (2007). However, as Terwee et al. (2007) do not take into account the strength of these correlations, Barker et al’s (2002) guidance was drawn on in relation to this 2: Specific hypotheses given AND at least 75% of results in line with these hypotheses AND a minimum of two correlations of r ≥ 0.50 to demonstrate convergent validity; 1: Specific hypotheses given AND less than 75% of results in line with these hypotheses AND/OR less than two correlations of r ≥ 0.50 to demonstrate convergent validity OR other clear methodological weakness; 0: No specific hypotheses given OR no information on construct validity Cohen’s (1988) benchmarks for effect sizes are used when describing the strength of correlations in text: r = 0.1–0.3 were considered small, r = 0.3–0.5 were considered medium, r = over 0.5 were considered large
6. Floor/ceiling effects	The number of respondents obtaining the highest or lowest possible scores	These criteria were based on Terwee et al. (2007) and adapted to be more specific regarding whether floor/ceiling effects were specifically examined to aid rating 2: Floor/ceiling effects specifically referenced AND ≤ 15% of the respondents achieved the highest or lowest possible scores; 1: Floor/ceiling effects specifically referenced AND > 15% of the respondents achieved the highest or lowest possible scores OR floor/ceiling effects not specifically referenced but data indicates ≤ 15% of the respondents achieved the highest or lowest possible scores; 0: No information on floor/ceiling effects
7. Interpretability	The extent to which qualitative meaning can be attached to quantitative scores or how differences in scores on the measure can be interpreted	These criteria were based on Terwee et al. (2007) with some adaptations in line with Strauss et al. (2016). Terwee et al. (2007) require minimal important change to be defined. However, as this was not considered relevant to the measures in this review, consideration was instead given to how scale scores might be interpreted 2: Mean and standard deviation scores provided for at least four relevant subgroups AND indication of how scale scores might be interpreted; 1: Mean and standard deviation scores provided for less than four relevant subgroups OR no indication of how scale scores might be interpreted provided; 0: No information on interpretability

However, criterion validity cannot be assessed in the absence of a gold-standard compassion measure. Furthermore, data was typically not available for responsiveness. Terwee et al. (2007) suggests assigning positive (+), indeterminate (?), or negative (−) ratings, or a rating of 0 where no information regarding the relevant criteria is provided. However, to make the findings of this review more directly comparable to the review of adult measures of compassion (Strauss et al., 2016), measures were given a rating of 2 if there was evidence for the criterion being fully met, 1 if the criterion was partially met, and 0 if the criterion was not met or if no relevant data was provided, Quality criteria are outlined in Table 1.

Quality ratings were then collated to give each included measure an overall rating. The total possible rating was 14. Where multiple papers evaluated the same measure, the following scoring criteria were applied when calculating the overall quality rating: a score of 2 if at least one study provided evidence that met the 2 criterion and no studies met a 1 or a 0 criterion that indicated evidence against the criterion (i.e. any 1 or 0 ratings would be for lack of evidence, not evidence against the criterion); a score of 1 if at least one study provided evidence that met the 1 criterion and no studies met a 0 criterion that indicated significant evidence against the criterion (i.e. not simply indicating lack of evidence), or where the “2” criterion above was not met and the criterion for “0” below was also not met; a score of 0 if any study met a 0 criterion indicating significant evidence against or all studies met a 0 criterion indicating lack of evidence.

Two authors independently rated the measures against these criteria, following initial discussion of the (papers on) these criteria to ensure shared understanding; any discrepancies were resolved collectively. Cohen’s kappa, calculated across the 19 selected papers and 7 properties (i.e. 133 ratings in total, ratings 0–2), was 0.94 (SE = 0.03, 95% CI [0.89, 0.99]), indicating almost perfect agreement (Landis & Koch, 1977). Where applicable, ratings across papers (i.e. for the SCS, SCS-Y, CS-C) were summated independently by the first and second author and subsequently discussed to reach consensus.

For papers involving translation, translation processes were evaluated broadly in line with WHO (Ustun et al., 2010) and Sousa and Rojjanasrirat (2010) best practice guidance, which include having a bilingual and monolingual (group of) translator(s), overseen by (a) bilingual subject-field expert(s) closely familiar with the measure and associated construct: (1) forward translation by a qualified translator(s) familiar with the construct and measure, supported by the bilingual experts. The aim is conceptual rather than word-for-word translation; (2) review of the translation by bilingual experts, followed by review of the translation by the monolingual group (speaking the target language). Discussion and potential redraft of the translation by the bilingual group; (3) Back-translation by independent professional translator(s) not involved in forward-translation who was blind to the original version. The bilingual group compares back-translation to the original for conceptual equivalence, resolving any differences, and; (4) pre-testing and cognitive interviewing—at least 10 respondents from the target population were invited to review the wording of the questionnaire.

Results

Selected Studies

After removing duplicates (n = 110), titles and abstracts of 149 remaining records were screened against the eligibility criteria. Subsequently, the full text of 26 papers was read to determine eligibility, resulting in 18 selected papers. The hand search resulted in one additional paper Fig. 1. The 19 studies selected for the review examined eight compassion measures. Five measured self-compassion, three (also) measured other-compassion, one (also) measured receiving compassion from others, and one (also) measured other-compassion for other living things. Three papers developed a compassion measure for CYP (Self-Compassion-Scale-Youth (SCS-Y), Compassion Scale – Child (CS-C), Self-Compassionate Reactions Scale – Child (SCRS-C)), six papers examined (adapted (parts of)) the adult SCS (n = 5) and CEAS (n = 1) for use with CYP, nine papers translated (and adapted) the SCS (n = 2), SCS-Y (n = 6) and CS-S (n = 1), and one paper adapted an existing Portuguese version of an adult measure of compassion (Compassion Scale – Adolescence; Sousa et al., 2022). See Table S1 in the Supplementary Information for an overview.

Review of Identified Measures

See Table S2 for an overview of participant demographic information. Measure quality ratings are shown in Table 2. Tables 3, 4 and 5 provide an overview of the psychometric properties outlined in each paper. Each paper and measure are described in greater detail below. See Table S3 for quality ratings for the individual papers. Table S4 provides a brief overview of translation processes employed in nine papers, information also summarised under evaluation of ‘content validity’ of a measure, where applicable.

Table 2

Quality ratings for measures

Measure	Content validity	Factor structure	Internal consistency	Test–retest reliability	Construct validity	Floor/ ceiling effects	Interpret ability	Total
CEAS-A	0	2	2	2	2	0	1	9
SCS-Y	1	1	1	1	1	1	2	8
SCS	0	1	1	0	2	1	2	7
CS-C	1	2	2	0	2	0	0	7
CS-A	0	2	1	0	1	0	1	5
SCS-C	0	2	2	0	0	0	0	4
S-SCS-A	1	0	2	0	0	0	1	4
SCRS-C	1	0	1	0	0	0	0	2

Rating: 0 criterion not met/insufficient data to rate, 1 criterion partially met, 2 criterion fully met. CEAS-A Compassionate Engagement and Actions Scales – Adolescence, CS-A Compassion Scale – Adolescence, CS-C Compassion Scale – Child, SCS Self-Compassion Scale, SCS-C Self-Compassion Scale – Child, SCS-Y Self-Compassion Scale – Youth, SCRS-C Self-Compassionate Reactions Scale – Child, S-SCS-A Shortened Self-Compassion Scale – Adolescence

Table 3

Overview of psychometric properties (content validity, factor structure and internal consistency)

Measure	Authors	Content validity: -Factors captured* -Item generation (recipient/expert groups consulted?)	Proposed factor structure	Support for factor structure: type of analysis (factor structure found)	Internal consistency: -adequate sample size for factor analyses? -Cronbach’s alpha/omega (total scale and subscales)
CEAS-A	Cunha et al. (2023)	4 factors (R, F, T and A) Recipients = yes Experts = no (original items were) [Adapted from Portuguese version of adult CEAS]	Three scales Self-compassion: Two first order, and two second order factors under overarching construct Compassion for others: Two factors under overarching construct Compassion from others: Two factors under overarching construct	CFA Self- compassion: Two first order, and two second order factors under overarching construct Compassion for others: Two factors under overarching construct Compassion from others: Two factors under overarching construct	Yes (n = 674) Total scales: α = 0.84 to 0.94 Subscales: α = 0.70 to 0.92
SCS	Cunha et al. (2016)	4 factors (U, F, T and A) (assumed) Recipients = Yes Experts = No [Portuguese translation]	Six factors represented under a single overarching construct	CFA (Six factors represented under a single overarching construct)	Yes (n = 3165) Total scale: α = 0.88 Subscales: α = 0.70 to 0.79
	Stolow et al. (2016)	4 factors (U, F, T and A) (assumed) Recipients = Reference not provided for paper adapting SCS so unable to assess Experts = As above [Revised child-suitable version of SCS]	NR	PCA (Two factors)	Yes (n = 193) Positive subscale: α = 0.87 (total), α = 0.82 to 0.89 (grade subgroups) Negative subscale: α = 0.92 (total), α = 0.89 to 0.93 (grade subgroups)
	Kumlander et al (2018)	4 factors (U, F, T and A) Recipients = no Experts = no (items original paper were) [Finnish translation]	Six factors represented under a single overarching construct or two factors	CFA (Two factors)	Yes (sample 1: n = 1725, re-tested with sample 2: n = 1497) Subscales: ω = 0.87 and 0.91
	Muris et al. (2018)	4 factors (U, F, T and A) Recipients = no Experts = no (items original paper were)	Two factors	PCA (Two factors)	Yes (n = 130) Total scale α = 0.89 Subscales: α = 0.61 to 0.84 (α values for specific subscales not given)
	Gruber et al. (2023)	4 factors (U, F, T and A) Recipients = No (& unable to assess in Stolow’s original paper) Experts = yes [German translation of Stolow’s revised child-suitable version of SCS]	Two, three or six factors Two factors: SCS-POS (SK, CH, MI) SCS-NEG (SJ, I, OI) Three factors SK-SJ CH-I MI-OI	CFA Best fit: Six- factor solution	Yes (n = 255) Total scale: α = 0.89 Subscales: Self-judgment α = 0.78 Overidentification α = 0.71 Isolation α = 0.78 Self-kindness α = 0.74 Common humanity α = 0.72 Mindfulness: α = 0.48
SCS-Y	Neff et al. (2021)	4 factors (U, F, T and A) Recipients = yes Experts = yes [re-written adult items of SCS]	Six factors represented under a single overarching construct	CFA and ESEM (sample 1 and 2) ESEM (Six factors represented under a single overarching construct)	Yes (sample 1: n = 279, sample 2: n = 402) Total: α = 0.82 (sample 1) and α = 0.85 (sample 2) Subscales: α < 0.70 for 1/6 subscales in samples 1 (overidentification) and 2 (mindfulness)
	Karakasidou et al. (2021)	4 factors (U, F, T and A) Recipients = no (items in original paper were) Experts = no (items in original paper were) [Greek translation]	Six factors represented under a single overarching construct	CFA (Six factors)	Yes (n = 193) Total: α = 0.64 Subscales NR
	Deniz et al. (2022)	4 factors (U, F, T and A) Recipients = no (items in original paper were), Experts = no (items in original paper were) [Turkish translation]	Six factors represented under a single overarching construct	CFA (Six factors represented under a single overarching construct)	Yes (n = 450) Total: α = 0.79 Subscales: NR
	Nazari et al. (2022)	4 factors (U, F, T and A) Recipients = yes, Experts = yes [Persian translation]	Six factors represented under a single overarching construct	ESEM (Six factors represented under a single overarching construct)	Yes (n = 532) Total: α = 0.88 Subscales: α = 0.80 to 0.90
	Huang et al. (2022)	4 factors (U, F, T and A) Recipients = yes Experts = Yes [Chinese translation]	Evaluated nine models: unidimensional, two-factor correlated CFA and ESEM, six-factor correlated CFA and ESEM models, bifactor CFA and ESEM models, two-bifactor correlated CFA and ESEM models Two-factor: SCS-POS (compassionate self-responding) vs SCS-NEG (uncompassionate self-responding) Bifactor = Six specific factors, one global SC factor, mutually orthogonal Two-bifactor: six specific factors, 2 global factors (compassionate vs uncompassionate self-responding). Global factors allowed to correlate	CFA and ESEM Six-factor correlated CFA and ESEM & bifactor ESEM, good model fit Six-factor correlated CFA solution better item factor loadings but inter-factor correlations of six-factor correlated and bifactor ESEMsuperior	Yes (n = 1298) SCS total: α = 0.85 SCS subscales: Self-kindness: α = 0.73 Self-judgment: α = 0.64 Common humanity: α = 0.69 Isolation: α = 0.78 Mindfulness: α = 0.73 Overidentification: α = 0.58
	Cheung et al. (2023)	4 factors (U, F, T and A) Recipients = No (items in original paper were) Experts = Yes [Chinese translation]	One, two or six factors	CFA Six-factor model best fit (2-factor model acceptable fit)	Yes (n = 2881, China = 1777, UK = 1104) The two countries exhibited an acceptable level of Cronbach’s alphas: SCS-Y total: China α = 0.76 UK α = 0.88 Subscales: Self-kindness: α = 0.79 Self-judgment: α = 0.79 Common humanity: α = 0.76 Isolation: α = 0.79 Mindfulness: α = 0.74 Overidentification: α = 0.73
	Henje et al. (2024)	4 factors (U, F, T and A) Recipients = Yes Experts = No (unclear) (items in original paper were) [Swedish translation]	i) bifactor model (one global factor and six specific factors. No cross-loadings allowed between G and specific factors, specific factors were allowed to correlate ii) two-bifactor model. No cross-loading allowed between G and specific factors. The two global factors (compassionate and uncompassionate self-responding) allowed to correlate with one another. Specific factors allowed to correlate	CFA Bifactor model and two-bifactor model excellent fit Comparative Chi-square test between models showed no significant difference	Yes (n = 316) SCS-Y total: α = 0.88 Self-kindness: α = 0.70 Self-judgement: α = 0.80 Common humanity: α = 0.87 Isolation: α = 0.78 Mindfulness: α = 0.69 Overidentification: α = 0.73 Girls: SCS-Y total: α = 0.89 Self-kindness: α = 0.72 self-judgement: α = 0.77 Common humanity: α = 0.86 Isolation: α = 0.75 Mindfulness: α = 0.72 Overidentification: α = 0.70 Boys: SCS-Y Total: α = 0.86 Self-kindness: α = 0.68 Self-judgement: α = 0.83 Common humanity: α = 0.88 Isolation: α = 0.82 Mindfulness: α = 0.62 Overidentification: α = 0.75
CS-C	Nas and Sak (2021)	3 factors (U, F and A) Recipients = yes, Experts = yes [Turkish original]	NR	EFA followed by CFA (Three factors)	Yes (n = 756) Total: α = 0.89 Subscales: α = 0.75 to 0.86
	Heidary et al. (2022)	3 factors (U, F and A) Recipients = yes, Experts = yes [Persian translation]	Three factors	CFA (Three factors)	Yes (n = 302) Subscales: α = 0.70 to 0.87
CS-A	Sousa et al. (2022)	4 factors (R, U, F and A) Recipients = no, Experts = yes (items in original paper also were) [adapted from existing Portuguese version (Sousa et al., 2022) of adult CS (Pommier et al., 2020]	Four factors represented under a single overarching construct	(bifactor) CFA and ESEM Hierarchical CFA model (Four factors represented under a single overarching construct) was best fit	Yes (sample 1: n = 658, sample 2: n = 183) Total: α = 0.86 (sample 1), α = 0.88 (sample 2) Subscales: α = 0.63 to 0.80 (sample 1), α = 0.71 to 0.80 (sample 2)
SCS-C	Sutton et al. (2018)	4 factors (U, F, T and A) Recipients = no, Experts = no (items original paper were) [adapted from SCS-SF (Raes et al., 2011)]	Two factors	CFA (Two factors)	Yes (n = 382) Subscales: α = 0.81 to 0.83
S-SCS-A	Muris et al. (2016)	4 factors (U, F, T and A) Recipients = yes Experts = yes [adapted from SCS – positive scales only]	Three factors represented under a single overarching construct	PCA (Three factors)	Yes (n = 132) Total scale: α = 0.84 Subscales: α = 0.74 to 0.79
SCRS-C	Zhou et al. (2019)	Unable to access scale Recipients = yes Experts = yes [Chinese original]	NR	CFA (Six factors under two overarching constructs)	Yes (n = 637) Total scale: α = 0.81 Subscales: α = 0.43 to 0.73 (first order), α = 0.78 to 0.81 (second order) Subscales: α < 0.70 for 5/6 first order subscales

^ns non-significant, NR Not reported, CEAS-A Compassionate Engagement and Actions Scales – Adolescence, CS-A Compassion Scale – Adolescence, CS-C Compassion Scale – Child, SCS Self-Compassion Scale, SCS-C Self-Compassion Scale – Child, SCS-Y Self-Compassion Scale – Youth, SCRS-C Self-Compassionate Reactions Scale – Child, S-SCS-A Shortened Self-Compassion Scale – Adolescence, R recognising suffering, U understanding the universality of suffering, F feeling for the person suffering, T tolerating uncomfortable feelings, A acting or being motivated to act to alleviate suffering (elements of compassion captured from the definition by Strauss et al., 2016)

Table 4

Overview of psychometric properties (test–retest reliability, floor/ceiling effects and interpretability)

Measure	Authors	Test–retest reliability: r or ICC (time between testing)	Floor and ceiling effects	Interpretability: - subgroups tested for differences -mean and standard deviation provided for number of subgroups
CEAS-A	Cunha et al. (2023)	Total scales: ICC = 0.97 to 0.98 (1 month)	NR	Gender: CEAS-A self- compassion scale. Female participants had lower levels of compassion (actions factor (female participants: M = 24.37, SD = 7.87, male participants: M = 25.67, SD = 7.33) CEAS-A compassion for others scale Female participants had higher levels of compassion (engagement and actions factors) CEAS-A compassion for others total: Female participants: M = 59.39, SD = 14.76, male participants: M = 55.73, SD = 12.15 Engagement factor: female participants: M = 32.69, SD = 8.71, male participants: M = 30.91, SD = 7.25; Action factor: female participants: M = 26.69, SD = 7.07, male participants: M = 24.82, SD = 6.24) M and SD: 2 subgroups
SCS	Cunha et al. (2016)	NR	No floor or ceiling effects observable from descriptive statistics and percentiles	Gender: Male participants had higher levels of self-compassion (M = 3.15, SD = 0.51) than female participants (M = 2.95 SD = 0.59). Male participants also had higher levels of self-kindness (male participants: M = 2.89, SD = 0.77, female participants: M = 2.83, SD = 0.77) and mindfulness (male participants: M = 3.12, SD = 0.77, female participants: M = 2.98, SD = 0.76) than female participants, and female participants had higher levels of self-judgement (male participants: M = 2.70, SD = 0.81, female participants: M = 3.03, SD = 0.83), isolation (male participants: M = 2.76, SD = 0.92, female participants: M = 3.10, SD = 0.93), and over- identification (male participants: M = 2.63, SD = 0.88, female participants: M = 3.05, SD = 0.89) than males M and SD: 2 subgroups
	Stolow et al. (2016)	NR	NR	Gender: Girls scored higher on the SCS negative subscale (M = 7.29, SD = 2.69) than boys (M = 6.25, SD = 2.35). No gender differences for SCS positive Age: Older participants scored higher on the SCS negative subscale (Grade 5: M = 12.55, SD = 2.11; Grade 8: M = 10.95, SD = 2.43; Grade 11: M = 10.22, SD = 2.71) M and SD: 5 subgroups
	Kumlander et al. (2018)	NR	NR	NR
	Muris et al. (2018)	NR	NR	NR
	Gruber et al. (2023)	NR	Data were not normally distributed, Kolmogorov- Smirnov and Shapiro–Wilk tests were significant (p ≤ 0.001)	Gender: Males (n = 104, M = 3.2, SD = 0.5) scored higher on the SCS total than females (M = 3.0, SD = 0.7) Females scored higher on SCS overidentification (M = 2.6, SD = 0.9) than males (M = 3.2, SD = 0.9) [items reverse coded] Females scored higher on SCS- isolation (M = 3.2, SD = 1.0) than males (M = 3.6, SD = 0.9) [items reverse coded] M and SD: 2 subgroups
SCS-Y	Neff et al. (2021)	Total: r = 0.83 Subscales: r =0.51 to 0.71 (3 weeks)	NR	Age: No difference (Study 1, 2 and 4) Grade: Trend for self-compassion to decrease in higher grades but not significant (Study 1 and 2). No difference (Study 4) Gender: No difference (Study 1). Males had higher levels of self-compassion (Study 2 and 4) but not significant (study 2). (Study 4: males: M = 3.18, SD = 0.54; females M = 2.99, SD = 0.68) Gender interaction: No interaction between gender and age, or gender and grade (Study 1, 2 and 4). However, trend for males to score slightly higher than females, and for females to slightly decrease in self- compassion with age (study 2) M and SD: 6 subgroups (for each study)
	Karakasidou et al. (2021)	NR	No floor or ceiling effects observable from normality data and percentile ranks	NR
	Deniz et al. (2022)	NR	NR	NR
	Nazari et al. (2022)	Total: ICC = 0.60 (4 weeks)	No floor or ceiling effects observable from normality data	Gender: Males had higher levels of self- compassion M and SD: not reported
	Huang et al. (2022)	NR	No floor or ceiling effects observable from normality data	NR
	Cheung et al. (2023)	NR	No floor or ceiling effects observable from descriptive statistics	Country (China vs UK): Chinese participants reported significantly higher on the six self-compassion factors than UK participants Chinese participants had higher total SCS-Y (M = 52.39, SD = 8.92) than UK participants (M = 45.83, SD = 10.96) UK participants tended to score higher on Item 15: Chinese participants tended to score higher on Item 17 Gender: Self-compassion factors are significantly higher in male participants than female participants M and SD: 2 subgroups (country) – not provided for gender
	Henje et al. (2024)	Three-week test–retest reliability ICC: SCS-Y Total: (0.86; 95% CI 0.81, 0.89; F = 12.81; p ≤.001) ICC subscales: self-kindness (0.74; 95% CI.66,.80; F = 6.64; p ≤.001), self-judgment (0.68; 95% CI 0.58, 0.86; F = 5.25; p ≤.001), common humanity (0.73; 95% CI 0.65, 0.79; F = 6.50; p ≤.001), isolation 0(.70; 95% CI 0.60, 0.77; F = 5.56; p ≤.001), mindfulness (0.74; 95% CI 0.65, 0.79; F = 6.55; p ≤.001) and overidentified (0.66; 95% CI 0.56, 0.74; F = 4.93; p ≤.001). The test’s stability over time was moderate to good	No floor or ceiling effects observable from descriptive statistics	The factor structure of SCS-Y-SE was equivalent for boys and girls Age: the factor structure of SCS-Y-SE was equivalent for the two age groups Gender: SCS Total: girls scored lower on the total self-compassion scale (M = 2.92, SD = 0.68) than boys (M = 3.19, SD = 0.67) SCS subscales: Girls scored higher than boys on self-judgement (girls: M = 2.96, SD = 0.97; boys: M = 3.43, SD = 1.08), isolation (girls: M = 3.25, SD = 0.90; boys: M = 3.67, SD = 1.03), over-identification (girls: M = 2.78, SD = 0.96; boys: M = 3.35 SD = 1.08) [all reverse coded] Age differences: SCS total: not significant Socioeconomic status: not significant M and SD: 4 subgroups
CS-C	Nas and Sak (2021)	NR	NR	NR
	Heidary et al. (2022)	NR	NR	NR
CS-A	Sousa et al. (2022)	NR	NR	Gender: Female participants in community sample had higher levels of compassion (M = 4.28, SD = 0.47) when compared to male participants (M = 3.93, SD = 0.56). Female participants scored higher on all subscales, except indifference, where male participants had higher levels Kindness: males: M = 3.74, SD = 0.82; females: M = 4.23, SD = 0.69 Common humanity: males: M = 4.18, SD = 0.71; females: M = 4.40, SD = 0.63 Mindfulness: males: M = 3.94, SD = 0.63; females: M = 4.27, SD = 0.55 Indifference: males: M = 2.17, SD = 0.76; females: M = 1.76, SD = 0.69 Community vs forensic sample: Male participants in the forensic sample had lower levels of compassion towards others (M = 3.77, SD = 0.67) than male participants in the community sample (M = 3.93, SD = 0.56). They also had lower levels of mindfulness (M = 3.67, SD = 0.82 vs community sample male participants: M = 3.94, SD = 0.63) Number of diagnoses: Indifference subscale associated with number of diagnoses M and SD: 3 subgroups
SCS-C	Sutton et al. (2018)	NR	NR	NR
S-SCS-A	Muris et al. (2016)	NR	NR	Gender: No differences for total sample. Female participants (M = 27.97, SD = 6.84) had lower levels of self- compassion than male participants (M = 31.54, SD = 6.11), and for self-kindness subscale (males: M = 10.29, SD = 2.80; females: M = 8.89, SD = 2.76) in older adolescent sample (15–17 years) Age: No difference M and SD: 2 subgroups
SCRS-C	Zhou et al. (2019)	Total: r = 0.46 Subscales: r = 0.58 to 0.61 (6 weeks)	NR	NR

Table 5

Overview of psychometric properties (construct validity )

Measure	Authors	Construct validity: correlation (Pearson’s r) of compassion measure with related constructs
CEAS-A	Cunha et al. (2023)	CEAS-A self-compassion (SC) total FSCSRS: r = 0.42 (self-reassurance), r = − 0.17 (self-criticism); SCS: r = 0.60 (positive), r = − 0.20 (negative); SLSS: r = 0.36 CEAS-A compassion for others (CFO) total FSCSRS: r = 0.40 (self-reassurance), r = 0.05^ns (self-criticism); SCS: r =.39 (positive), r = 0.05^ns (negative); SLSS: r = 0.24 CEAS-A compassion from others (CfO) total FSCSRS: r = 0.41 (self-reassurance), r = − 0.11 (self-criticism); SCS: r = 0.59 (positive), r = − 0.07 (negative); SLSS: r = 0.36
SCS	Cunha et al. (2016)	SCS EMWSS: r = 0.41; DASS- 21: r =—0.50 (depression), r = − 0.38 (anxiety), r = − 0.48 (stress) SCS self-kindness EMWSS: r = 0.34; DASS- 21:—0.21(depression), r = − 0.11 (anxiety), r = − 0.16 (stress) SCS self-judgement EMWSS: r = − 0.23; DASS- 21: r = 0.44 (depression), r = 0.39 (anxiety), r = 0.47 (stress) SCS common humanity EMWSS: r = 0.25; DASS- 21: − 0.08 (depression), N/A^ns (anxiety), − 0.05 (stress) SCS isolation EMWSS: r = − 0.31; DASS- 21: r = 0.52 (depression), r = 0.39 (anxiety), r = 0.48 (stress) SCS mindfulness EMWSS: r = 0.29; DASS- 21: r =—0.23 (depression), r = − 0.16 (anxiety), r = − 0.20 (stress) SCS overidentification EMWSS: − 0.23; DASS- 21: r = 0.48 (depression), r = 0.42 (anxiety), r = 0.52 (stress)
	Stolow et al. (2016)	SCS positive CDI (depression): r = − 0.15; CDEQ-SC (self-criticism): r =—0.11^ns; SEQ (self-esteem): r = 0.23 SCS negative CDI: r = 0.58; CDEQ-SC: r = 0.67; SEQ: r = − 0.62
	Kumlander et al. (2018)	Self-compassion—RBDI: r = − 0.40 Self-coldness—RBDI: r = 0.60 Self-judgment—RBDI: r = 0.56 Isolation—RBDI: r = 0.64 Over-identification—RBDI: r = 0.53 Self-kindness—RBDI: r = − 0.47 Common humanity—RBDI: r = − 0.29 Mindfulness—RBDI: r = − 0.29
	Muris et al. (2018)	SCS self-kindness STAIC: r = − 0.38; CDI: r = − 0.53; UCL-A: r = 0.50 (active tackling), r = 0.07^ns (palliative reacting), r = − 0.07^ns (avoidance), r = 0.42 (social support seeking), r = − 0.47 (passive reacting), r = − 0.29 (expression of emotion), r = 0.41(reassuring thoughts) SCS self-judgement STAIC: r = 0.61; CDI: r = 0.58; UCL-A A: r = − 0.25 (active tackling), r = 0.18 (palliative reacting), r = 0.19 (avoidance), r = − 0.25 (social support seeking), r = 0.56 (passive reacting), r = 0.25 (expression of emotion), r = − 0.11 (reassuring thoughts) SCS common humanity STAIC: r = − 0.19; CDI: r = − 0.32; UCL-A: r = 0.37 (active tackling), r = 0.13^ns (palliative reacting), r = 0.00^ns (avoidance), r = 0.22 (social support seeking), r = − 0.33 (passive reacting), r = − 0.23 (expression of emotion), r = 0.47 (reassuring thoughts) SCS isolation STAIC: r = 0.58; CDI: r = 0.53; UCL- A: r = − 0.25 (active tackling), r =—0.02^ns (palliative reacting), r = 0.22 (avoidance), r = − 0.28 (social support seeking), r = 0.53 (passive reacting), r = 0.17^ns (expression of emotion), r = − 0.09^ns (reassuring thoughts) SCS mindfulness STAIC: r = − 0.25; CDI: r = − 0.38; UCL-A: r = 0.48 (active tackling), r = 0.22 (palliative reacting), r = − 0.09^ns (avoidance), r = 0.30 (social support seeking), r = − 0.36 (passive reacting), r = − 0.21 (expression of emotion), r = 0.35 (reassuring thoughts) SCS overidentification STAIC: r = 0.58; CDI: r = 0.49; UCL-A: r = − 0.25 (active tackling), r =—0.06^ns (palliative reacting), r = 0.09^ns (avoidance), r = − 0.08^ns (social support seeking), r = 0.56 (passive reacting), r = 0.26 (expression of emotion), r = − 0.19 (reassuring thoughts)
	Gruber et al. (2023)	Tested ‘various correlations between SCS-CA, measures of mindfulness, quality of life, and externalizing and internalizing symptoms’ SCS-CA total: CAMM: r = 0.64 YSR-SF: r = − 0.68 YRS-SF-IA (internalization-anxiety): r = − 0.55 YSR-SF-ID (internalization-depression): r = 0.66 YSR-SF-ED (externalization-destructiveness): r = − 0.28 YSS-SF-EE (externalization-exhibitionism): r = − 0.38 ILK (quality of life): r = 0.67 SCS self-kindness: CAMM (mindfulness): r = 0.45 YSR-SF: r = − 0.50 YRS-SF-IA: r = − 0.39 YSR-SF-ID: r = − 0.50 YSR-SF-ED: r = − 0.22 YSR-SF-EE: r = − 0.29 ILK: r = 0.53 SCS self-judgement: CAMM: r = 0. 65 YSR-SF: r = − 0.67 YRS-SF-IA: r = − 0.58 YSR-SF-ID: r = − 0.63 YSR-SF-ED: r = − 0.26 YSR-SF-EE r = − 0.38 ILK: r = 0.57 SCS common humanity: CAMM (mindfulness): r = 0.12^ns YSR-SF: r = − 0.26 YRS-SF-IA: r = − 0.17 YSR-SF-ID: r = − 0.25 YSR-SF-ED: r = − 0.14 YSR-SF-EE: r = − 0.19 ILK: r = 0.26 SCS isolation: CAMM: r = 0.64 YSR-SF: r = − 0.62 YRS-SF-IA: r = − 0.52 YSR-SF-ID: r = − 0.67 YSR-SF-ED: r = − 0.20 YSR-SF-EE: r = − 0.24 ILK: r = 0.65 SCS mindfulness: CAMM: r = 0.24 YSR-SF: r = − 0.26 YRS-SF-IA: r = − 0.19 YSR-SF-ID: r = − 0.21 YSR-SF-ED: r = − 0.20 YSR-SF-EE: r = − 0.19 ILK: r = 0.31 SCS overidentification: CAMM: r = 0.55 YSR-SF: r = − 0.54 YRS-SF-IA: r = − 0.44 YSR-SF-ID: r = − 0.53 YSR-SF-ED: r = − 0.19 YSR-SF-EE: r = − 0.32 ILK: r = 0.49
SCS-Y	Neff et al. (2021)	Specified hypotheses SCS-Y total: CAMM: r = 0.47; CES-DC: r = − 0.53; SHS: r = 0.60; SWLS-C: r = 0.49; BRS: r = 0.65; PALS: r = 0.37 (mastery approach); − 0.18 (performance approach); r = − 0.38 (performance avoidance) SCS-Y self-kindness CAMM: r = 0.34; CES-DC: r = − 0.40; SHS: r = 0.60; SWLS-C: r = 0.50; BRS: r = 0.52; PALS: r = 0.44 (mastery approach); − 0.03 ns (performance approach); r = − 0.19 (performance avoidance) SCS-Y self-judgment CAMM: r = 0.53; CES-DC: r = − 0.52; SHS: r = 0.46; SWLS-C: r = 0.36; BRS: r = 0.60; PALS: r = 0.19 (mastery approach); − 0.34 (performance approach); r = − 0.40 (performance avoidance) SCS-Y common humanity CAMM: r = 0.11 ns; CES-DC (depression): r = − 0.09 ns; SHS (happiness): r = 0.30; SWLS-C (life satisfaction): r = 0.29; BRS (resilience): r = 0.25; PALS: r =.35 (mastery approach); 0.06 ns (performance approach); r = − 0.10 ns (performance avoidance) SCS-Y isolation CAMM: r = 0.46; CES-DC: r = − 0.50; SHS: r = 0.50; SWLS-C: r = 0.41; BRS: r = 0.50; PALS: r = 0.26 (mastery approach); − 0.20 (performance approach); r = − 0.38 (performance avoidance) SCS-Y mindfulness CAMM: r = 0.07^ns; CES-DC: r = -.12^ns; SHS: r = 0.35; SWLS-C: r = 0.31; BRS: r = 0.26; PALS: r = 0.35 (mastery approach); − 0.13^ns (performance approach); r = − 0.12^ns (performance avoidance) SCS-Y overidentification CAMM: r = 0.36; CES-DC: r = − 0.45; SHS: r = 0.27; SWLS-C: r = 0.16; BRS: r = 0.47; PALS: r = -.00^ns (mastery approach); − 0.30 (performance approach); r = − 0.33 (performance avoidance)
	Karakasidou et al. (2021)	SCS-Y total SWLS: r = 0.30; BRS: r = 0.42; SPANE: r = − 0.35 (negative), r = 0.42 (positive); STAIC: r = − 0.19 (state), r = − 0.44 (trait); SHS: r = 0.36; DASS: r = − 0.27 (depression), r = − 0.23 (stress) SCS-Y self-kindness SWLS: r = 0.35; BRS: r = 0.27; SPANE: r = − 0.29 (negative), r = 0.38 (positive); STAIC: r = − 0.05^ns (state), r = − 0.25 (trait); SHS: r = 0.32; DASS: r = − 0.11^ns (depression), r =—0.08^ns (stress) SCS-Y self-judgment SWLS: r = − 0.16; BRS: r = − 0.23 SPANE: r = 0.22 (negative), r = − 0.21 (positive); STAIC: r = 0.18 (state), r = 0.30 (trait); SHS: r = − 0.15; DASS: r = 0.19 (depression), r = 0.21 (stress) SCS-Y common humanity SWLS: r = 0.01^ns; BRS: r = − 0.10 ns; SPANE: r = 0.04^ns (negative), r =.12 ns (positive); STAIC: r = 0.21 (state), r = 0.09 ns (trait); SHS: r = 0.06 ns; DASS: r = 0.12^ns (depression), r = 0.15 (stress) SCS-Y isolation SWLS: r = − 0.14 ns; BRS: r = − 0.31; SPANE: r = 0.25 (negative), r = - 0.11^ns (positive); STAIC: r = 0.25 (state), r = 0.36 (trait); SHS: r = − 0.17; DASS: r = 0.30 (depression), r = 0.25 (stress) SCS-Y mindfulness SWLS: r = 0.27; BRS: r = 0.24; SPANE: r = − 0.19 (negative), r = 0.39 (positive); STAIC: r = − 0.12 ns (state), r = − 0.24 (trait); SHS: r = 0.31; DASS: r = − 0.13 ns (depression), r =—0.13^ns (stress) SCS-Y overidentification SWLS: r = − 0.04^ns; BRS: r = − 0.36; SPANE: r = 0.19 (negative), r = - 0.14^ns (positive); STAIC: r = 0.21 (state), r = 0.32 (trait); SHS: r = - 0.12^ns; DASS: r = 0.24 (depression), r = 0.22 (stress)
	Deniz et al. (2022)	Reported to be positively related to resilience and well-being, and negatively related to depression but examined by network analysis, and r values not given
	Nazari et al. (2022)	SCS-Y total Examined with SEM analysis and r values not reported SCS-Y self-kindness PHQ- 2: r = − 0.37; BFI- 10: r = − 0.32; BRS: r = 0.42; YLOT: r = 0.30 SCS-Y self-judgement PHQ- 2: r = 0.32; BFI- 10: r = 0.44; BRS: r = − 0.39; YLOT: r = − 0.24 SCS-Y common humanity PHQ- 2: r = − 0.22; BFI- 10: r = − 0.14; BRS: r = 0.24; YLOT: r = 0.22 SCS-Y isolation PHQ- 2: r = 0.28; BFI- 10: r = 0.25; BRS: r = − 0.32; YLOT: r = − 0.29 SCS-Y mindfulness PHQ- 2: r = − 0.34; BFI- 10: r = − 0.39; BRS: r = 0.38; YLOT: r = 0.27 SCS-Y overidentification PHQ- 2: r = 0.22; BFI- 10: r = 0.16; BRS: r = − 0.23; YLOT: r = − 0.18
	Huang et al. (2022)	Concurrent validity was evaluated by Pearson’s correlation coefficients between the SCS-Y and other criteria variables SCS-Y total: GAD- 7: r = − 0.60 PHQ- 9: r = − 0.58 CD-RISC- 10: r = 0.67 WHO- 5: r = 0.57 ICQ- 15: r = 0.47
	Cheung et al. (2023)	Discriminant validity: Item discrimination values for all SCS-Y items ranged between 1.70 and 2.74, suggesting a very high discrimination ability for the items. All subscales reported an average variance extracted (AVE) higher than 0.50, corresponding to an acceptable discriminant validity SCS-Y total: SCS total China: (Spearman) r = 0.75 SCS total UK: (Spearman) r = 0.84
	Henje et al. (2024)	SCS-Y-SE total: SCS: r = 0.72 WHO- 5: r = 0.48 BYI (Depression): r = − 0.59 BYI (Anger) r = : − 0.49 RCADS- R (anxiety): r = − 0.56 SDQ Functioning: r = − 0.29 PROMIS: r = − 0.21 SCS-Y Self-kindness SCS 0: r = 0.43 WHO- 5 0: r = 0.32 BYI (Depression): r = − 0.40 BYI (Anger): r = − 0.34 RCADS- R (anxiety): r = − 0.36 SDQ (strengths and difficulties): r = − 0.29 PROMIS: r = − 0.21 SCS-Y Self-Judgement: SCS: r = 0.78 WHO- 5: r = 0.50 BYI (Depression): r = − 0.61 BYI (Anger): r = − 0.49 RCADS- R (anxiety): r = − 0.61 SDQ: r = − 0.23 PROMIS: r = − 0.19 SCS-Y Common Humanity: SCS: r = 0.21 WHO- 5: r = 0.18 BYI (Depression); r = − 0.18 BYI (Anger): r = − 0.17 RCADS- R (anxiety): r = − 0.16 SDQ: r = − 0.17 PROMIS: r = 0.00^ns SCS-Y Isolation: SCS: r = 0.74 WHO- 5: r = 0.38 BYI (Depression): r = − 0.55 BYI (Anger): r = − 0.38 RCADS- R (anxiety): r = − 0.51 SDQ: r = − 0.15 PROMIS: r = − 0.18 SCS-Y Mindfulness: SCS: r = 0.25 WHO- 5: r = 0.25 BYI (Depression): r = − 0.28 BYI (Anger): r = − 0.32 RCADS- R (anxiety): r = − 0.26 SDQ: r = − 0.24 PROMIS: r = − 0.20 SCS-Y Overidentification: SCS: r = 0.68 WHO- 5: r = 0.45 BYI (Depression): r = − 0.53 BYI (Anger): r = − 0.40 RCADS- R (anxiety): r = − 0.51 SDQ: r = − 0.14^ns PROMIS (Pediatric Pain Interference) v2:− 0.13
CS-C	Nas and Sak (2021)	NR
	Heidary et al. (2022)	ERQ-CA: r = 0.44; CAMM: r = 0.51; RSPWB: r = 0.61; STAI: r = 0.00 (linear); STAI: r = − 0.19 (non-linear)
CS-A	Sousa et al. (2022)	CS-A total OASB-A: r = − 0.22; FSCRS self- criticism subscales: r = − 0.05^ns (inadequate self), r = − 0.13 (hated self); FSCRS reassured subscale: r = 0.19; SCS-total: r = 0.10; SCS-subscales: r = 0.13 (self-kindness), r = 0.24 (common humanity), r = 0.15 (mindfulness), r = − 0.02^ns (self- judgement), r = − 0.04^ns (isolation), r = 0.01^ns (over identification) CS-A kindness OASB-A: r = − 0.14; FSCRS self- criticism subscales: r = − 0.02^ns (inadequate self), r = − 0.06^ns (hated self); FSCRS reassured subscale: r = 0.14; SCS-total: r = 0.03^ns SCS-subscales: r = 0.07^ns (self- kindness), r = 0.17 (common humanity), r = 0.10 (mindfulness), r = 0.03^ns (self-judgement), r = 0.04^ns (isolation), r = 0.05^ns (over identification) CS-A common humanity OASB-A: r = − 0.12; FSCRS self-criticism subscales: r = − 0.01^ns (inadequate self), r = − 0.10 (hated self); FSCRS reassured subscale: r = 0.25; SCS-total: r = 0.15 SCS-subscales: r = 0.17 (self- kindness), r = 0.30 (common humanity), r = 0.20 (mindfulness), r = − 0.03^ns (self-judgement), r = − 0.07^ns (isolation), r = − 0.02^ns (overidentification) CS-A mindfulness OASB-A: r = − 0.16; FSCRS self-criticism subscales: r = − 0.01 ns (inadequate self), r = − 0.10 (hated self); FSCRS reassured subscale: r = 0.15; SCS-total: r = 0.09 SCS-subscales: r = 0.12 (self- kindness), r = 0.21 (common humanity), r = 0.19 (mindfulness), r = 0.03^ns (self-judgement), r = − 0.02^ns (isolation), r = − 0.01^ns (over identification) CS-A indifference OASB-A: r = 0.21; FSCRS self-criticism subscales: r = 0.09 (inadequate self), r = 0.14 (hated self); FSCRS reassured subscale: r = − 0.10; SCS-total: r = − 0.04^ns SCS-subscales: r = − 0.04^ns (self- kindness), r = − 0.08 (common humanity), r = − 0.01^ns (mindfulness), r = − 0.03^ns (self-judgement), r = 0.07^ns (isolation), r = − 0.01^ns (over identification)
SCS-C	Sutton et al. (2018)	SCS-C positive MAAS-C: r = 0.16; SDQ: r = 0.50 (general), r = 0.39 (school); RI: r = 0.45; SWLS-C: r = 0.40; PANAS-C: r = 0.42 (positive), r = − 0.09^ns (negative); SPQ: r = − 0.13 (anxiety), r = − 0.22 (depression); IRP: r = 0.42 (empathetic concern), r = 0.54 (perspective taking); SGQ: r = 0.60 SCS-C negative MAAS-C: r = − 0.41; SDQ: r = − 0.10^ns (general), r = − 0.05^ns (school); RI: r = − 0.39; SWLS-C: r = − 0.25; PANAS- C: r = − 0.16 (positive), r = 0.46 (negative); SPQ: r = 0.51 (anxiety), r = 0.43(depression); IRP: r = 0.21 (empathetic concern), r = 0.08^ns (perspective taking); SGQ: r = 0.09^ns
S-SCS-A	Muris et al. (2016)	S-SCS-A total r = 0.44 (SPPC), r = 0.50 (SEQ-C), r = − 0.26 (SCARED), r = − 0.35 (CDI) S-SCS-A self-kindness r = 0.37 (SPPC), r = 0.32 (SEQ-C), r = − 0.12^ns (SCARED), r = − 0.28 (CDI) S-SCS-A common humanity r = 0.17^ns (SPPC), r = 0.28 (SEQ-C), r = − 0.19 (SCARED), r = − 0.12 (CDI) S-SCS-A mindfulness r = 0.54 (SPPC), r = 0.63 (SEQ-C), r = − 0.34 (SCARED), r = − 0.45 (CDI)
SCRS-C	Zhou et al. (2019)	SCRS-C total r = 0.47 (SAS), r = 0.28 (MAAS-C), r = 0.60 (RSES), r = 0.03 (DDDT) SCRS-C compassionate reaction r = 0.11^ns (SAS), r = 0.06^ns (MAAS- C), r = 0.28 (RSES), r = 0.06^ns (DDDT) SCRS-C uncompassionate reaction r = − 0.56 (SAS), r = − 0.34 (MAAS-C), r = − 0.58 (RSES), r = 0.01^ns (DDDT)

^ns non-significant, NR Not reported, CEAS-A Compassionate Engagement and Actions Scales – Adolescence, CS-A Compassion Scale – Adolescence, CS-C Compassion Scale – Childm SCS Self-Compassion Scale, SCS-C Self-Compassion Scale – Child, SCS-Y Self-Compassion Scale – Youth, SCRS-C Self-Compassionate Reactions Scale – Child, S-SCS-A Shortened Self-Compassion Scale – Adolescence, BFI- 10 Brief 10‑Item Big Five Inventory (Rammstedt & John, 2007), BRS Brief Resilience Scale (Smith et al., 2008), BYI Beck Youth Inventories (BYI) (Beck et al., 2001), CAMM Child and Adolescent Mindfulness Measure (Greco et al., 2011), CDEQ-SC Children’s Depressive Experiences Questionnaire – self-criticism subscale (Abela & Taxel, 2001), CDI Children’s Depression Inventory (Kovacs, 1981), CD-RISC- 10 The 10-item Connor-Davidson Resilience Scale (Campbell-Sills & Stein, 2007) CES-DC The Center for Epidemiological Studies Depression Scale for Children (Faulstich et al., 1986), DASS Depression Anxiety Stress Scales: (Stalikas & Flora, 2012), DASS- 21 Depression, Anxiety and Stress Scales (Lovibond & Lovibond, 1995, Portuguese version: Pais-Ribeiro et al., 2004), DDDT Dirty Dozen Dark Triad (Jonason & Webster, 2010), DERS‑SF Difficulties in Emotion Regulation Scale – Short Form (Kaufman et al., 2016), EMWSS Early Memories of Warmth and Safeness Scale (Richter et al., 2009, Portuguese version for adolescents: Cunha et al., 2014), ERQ-CA Emotion Regulation Questionnaire-Children and Adolescents (Gullone & Taffe, 2012), FSCRS The Forms of Self-Criticizing/Attacking & Self-Reassuring Scale (Castilho & Pinto-Gouveia, 2011), FSCSR The Forms of Self-Criticizing and Self-Reassuring Scale (Gilbert et al., 2004, Portuguese version: Silva & Salvador, 2010), GAD- 7 Generalized Anxiety Disorder Scale (Spitzer et al., 2006), ICQ− 15 15-item Interpersonal Competence Questionnaire (Coroiu et al., 2015), ILK Inventory for measuring quality of life in children (Mattejat & Remschmidt, 2006) IRP Interpersonal Reactivity Index (Davis, 1983, Oberle et al., 2010), MAAS-C Mindful Attention Awareness Scale – Children (Lawlor et al., 2014), OASB-A Other as Shamer Scale Brief – Adolescent version (Cunha et al., 2015), PALS Patterns of Adaptive Learning Scales (Midgley et al., 1998), PANAS-C Positive and Negative Affect Schedule for Children (Laurent et al., 1999), PHQ- 2 Patient Health Questionnaire (Kroenke et al., 2003), PHQ- 9 Patient Health Questionnaire- 9 (PHQ- 9) Kroenke et al., 2001), PROMIS Patient Reported Outcome Measurements Information System (PROMIS) (Cella et al., 2007), RBDI Revised Beck Depression Inventory (Finnish modification, Kaltiala-Heino et al., 1999), RCADS-R Revised Child Anxiety and Depression Scale (RCADS) RCADS long scale (Chorpita et al., 2005) RI Resiliency Inventory – optimism subscale (Song, 2003), RSES 10-item Rosenberg Self-Esteem Scale (Chinese version) (Rosenberg, 1965), RSPWB Ryff Scale Psychological Wellbeing (Khanjani et al., 2014), SAS Self-Acceptance Scale (Cong & Gao, 1999), SCARED Screen for Child Anxiety Related Emotional Disorders (Birmaher et al., 1997), SCS Self-Compassion Scale (Cunha et al., 2016), SDQ Self-Description Questionnaire (Marsh, 1988), SDQ Strength and Difficulties Questionnaire (Goodman, 2001) SEQ Self-Esteem Questionnaire (Rosenberg, 1965), SEQ-C Self-Efficacy Questionnaire for Children (Muris, 2001), SGQ Social Goals Questionnaire – prosocial goals subscale (Wentzel, 1993), SHS Subjective Happiness Scale (Lyubomirsky & Lepper, 1999), SLSS Students’ Life Satisfaction Scale (Marques et al., 2007), SPANE Scale of Positive and Negative Experience (Diener et al., 2010), SPPC Self-Perception Profile for Children (Harter, 1985), SPQ Seattle Personality Questionnaire for Young School-Aged Children – anxious and depressive symptoms subscales (Kusche et al., 1988), STAI State-Trait Anxiety Inventory, STAIC State-Trait Anxiety Inventory for Children (Spielberger, 1973), SWLS Life Satisfaction Scale (Stalikas & Lakioti, 2012), SWLS-C Satisfaction with Life Scale – Children (Gadermann, 2009), UCL-A The Utrecht Coping List for Adolescents (Bijstra et al., 1994), WHO- 5 5-item World Health Organization Well-being Index (Blom et al., 2012), YLOT Youth Life Orientation Test (Ey et al., 2005), YSR-SF Youth Self Report – short form (Cruz et al., 2014)

Compassionate Engagement and Action Scales–Adolescence (CEAS-A)

The CEAS is three self-report scales, assessing self-compassion (SC), compassion for others (CFO), and the ability to receive compassion from others (CfO), respectively. Each has two sections: (1) compassionate engagement (8 items); and (2) compassion action (5 items). The CEAS (and CEAS-A) use a 10-point response scale (ranging from never to always).

Population

The main sample was 674 school students in Portugal, aged 12–19 (M = 14.88, 61% female). Construct validity was assessed with a subsample of 336 participants (M = 15.48, 73% female). Test–retest reliability was assessed with 76 participants (M = 15.87, 89% female).

Content Validity

Cunha et al. (2023) adapted the adult version of the (Portuguese version of the) CEAS (Gilbert et al., 2017) to use with young people. Items were discussed with a group of adolescents (n = 18) and revised to make them more understandable and easier for adolescent use. However, experts were not consulted. Items related to 4/5 elements of compassion (not universality).

Factor Structure and Reliability

CFA supported the factor structure of the CEAS-A that was found for the adult version, including two distinct processes: 1) engagement with suffering, 2) an action component to alleviate or prevent suffering. The three-order factor model was a very good fit. The higher-order factor compassion for self, included two second-order factors: the engagement and actions subscales. The engagement subscale comprised the sensitivity to suffering and engagement with suffering dimensions. However, the authors noted that the model fit resulted from error term correlations between two pairs of items of the actions subscale, which they suggested might be related to similar phrasing in the Portuguese version of Items 4 and 5. For the CFO and CfO scales, a higher-order factor encompassed two first-order factors: the engagement and actions subscales. The model presented a good fit to the data, after removal of item 4 on the engagement subscale, for CFO. Cunha et al. (2023) found good–excellent internal consistency for the total scales (α = 0.84 to 0.94), acceptable-excellent internal consistency for subscales (α = 0.70 to 0.92) and excellent test–retest reliability for the three total scales (ICC = 0.97 to 0.98).

Construct Validity and Interpretability

Congruent with hypotheses, the CEAS-A scales were positively correlated with one another, and had positive associations with self-reassurance, life satisfaction, and the self-compassion scale positive factor, with small-large effects. In relation to discriminant validity, Cunha et al. (2023) reported that correlations between the CEAS-A scales and negative variables were weaker than positive variables. SC and CfO scales had negative associations with the self-compassion negative factor and self-criticism, with small effects. Ten correlations were r ≥ 0.50.

Means and standard deviations were reported for the total sample, males, and females for total and subscale scores. Subgroup analyses were conducted. Females scores higher than males on the CFO scale, and lower than males on the SC scale. Floor and ceiling effects were not examined.

Summary

The CEAS-A was rated 9/14 for quality. Whilst respondents were consulted, experts were not, and the’ understanding the universality of suffering’ element of compassion was not captured by items. Experts could be engaged to develop and refine items related to ‘universality of suffering’. CFA supported the proposed factor structure, and internal consistency and test–retest reliability were good. The error term correlations identified in the ‘actions’ subscale items should have been addressed; these items should have been revised to improve model fit without compromising conceptual integrity. There was evidence for construct validity and interpretability was facilitated. Floor and ceiling effects were not assessed.

Self-Compassion Scale (SCS)

The SCS was developed by Neff (2003b) to assess self-compassion in adults and is the most widely used measure of self-compassion (Neff et al., 2019). It consists of 26 items on a 5-point response scale (ranging from almost never to almost always) and has six subscales (self-kindness, mindfulness, common humanity, self-judgment, isolation, over- identification). Five papers examined the psychometric properties of the SCS (Cunha et al., 2016; Gruber et al., 2023; Kumlander et al., 2018; Muris et al., 2018; Stolow et al., 2016).

Population

The sample in Cunha et al (2016) included 3165 CYP aged 12–19 from Portugal (M = 15.49, 53.8% female) and in Stolow et al. (2016) included 193 CYP aged 9–16 from the US (M = 13.00, 59% female, 58% Caucasian). The sample in Kumlander et al. (2018) included 1725 adolescents from Finland, with 75% aged 15–17 (M = 16.56, 50.3% female). Muris et al. (2018) recruited 130 CYP aged 15–19 from the Netherlands (M = 16.68, 66% female). Gruber et al. (2023) evaluated the SCS with a German community sample of 10–19-year-olds (M = 14.9, SD = 2.6, 59.2% female). No papers reported SES, and only Stolow et al. (2016) reported ethnicity.

Content Validity

Cunha et al. (2016) used the Portuguese version (Pinto-Gouveia, 2011) of Neff’s (2003b) SCS, adapted for adolescents (Cunha et al., 2013). A group of adolescents were consulted on the items but independent experts were not. Stolow et al. (2016) used a revised child-suitable version of the SCS, stating original meaning was maintained whilst ensuring items could be understood by younger children. However, no reference was provided, and it was not possible to assess the items used. Gruber et al. (2023) translated the child-revised version of the SCS by Stolow et al. (2016) to German, using an independent translator for backward translation and consulting independent mindfulness and compassion experts. The SCS was translated from English to Finish by Kumlander et al. (2018), by the research group and an independent professional translator. A further independent translator did the backward translation; no independent bilingual experts were consulted. No adaptations were reported by Muris (2017). None of the papers consulted both experts and CYP . Items related to 4/5 elements of compassion (not recognising). Whilst it was not possible to assess the items used in two papers, the aforementioned was assumed.

Factor Structure and Reliability

Cunha et al. (2016) conducted CFA, which supported the use of an overall self-compassion score and six subscale scores, congruent with the adult SCS (Neff, 2003b) and Neff’s (2003a) conceptualisation of compassion (also see Neff et al., 2019).

Stolow et al. (2016) conducted Principal Component Analysis (PCA), yielding two factors. The SCS-positive factor comprised of items from the positive subscales and the SCS-negative factor of items from the negative subscales. Neither EFA nor CFA were conducted. Kumlander et al. (2018) conducted CFA, which supported the six-factor model and two-factor model. However, high correlations among the three positive and three negative factors for the six-factor model showed strong multicollinearity, questioning the extent to which the six factors measured separate constructs. Therefore, use of two factors (self-compassion and self-coldness) was supported, and use of six subscales or total self-compassion score was not recommended. Findings were replicated in a second sample. Muris et al. (2018) did not conduct CFA but conducted joint PCA on all questionnaire data (including convergent validity measures). Two factors emerged. The positive subscales loaded on a factor also composed of adaptive coping styles, and the negative subscales loaded on a factor with scales measuring anxiety and depression and maladaptive coping styles. Whilst positive subscales were considered representative of self-compassion, they concluded negative subscales should be removed as they are indicators of vulnerability and inflate the relationship between self-compassion and symptoms of mental health difficulties. Use of the total score including negative subscales was cautioned. Gruber et al. (2023), however, tested a two- (positive and negative scales, respectively), three- (the three bipolar components of self-compassion) and six-factor solution using CFA and the six-factor solution was the best fit.

Cunha et al. (2016) found good internal consistency for the total SCS (α = 0.88) and acceptable internal consistency for subscales (α = 0.70 to 0.79). Kumlander et al. (2018) found good internal consistency for the two factors (ω = 0.87, 0.91). Stolow et al. (2016) found good–excellent internal consistency for the total positive and negative subscales (α = 0.87, 0.92) for the total sample, and for each school grade (α = 0.82 to 0.93). Muris et al. (2018) found internal consistency was good for the total scale (α = 0.89) and questionable-good for subscales (α = 0.61 to 0.84). Gruber et al. (2023) reported acceptable internal consistency for subscales (α = 0.72–0.78), except for the mindfulness subscale (α = 0.48), and good internal consistency for the total scale (α = 0.89). No paper reported test–retest reliability.

Construct Validity and Interpretability

As hypothesized by Cunha et al. (2016), the SCS was positively associated with early memories of warmth and safeness, and negatively associated with depression, anxiety, and stress. These were medium effects, and one correlation was r ≥ 0.50. Hypotheses were not provided for subscales. Positive subscales were positively associated with early memories of warmth and safeness, and negatively associated with depression, anxiety (except common humanity), and stress. The inverse direction was found for negative subscales. Effects for subscales were small-medium. Stolow et al. (2016) did not specify hypotheses for construct validity. The negative subscale was positively associated with depression and self-criticism, and negatively associated with self-esteem. The positive subscale was negatively associated with depression and positively associated with self-esteem, and not associated with self-criticism. Effects were small-large, and three correlations were r ≥ 0.50. As hypothesized, Kumlander et al. (2018) found the self-compassion factor was negatively associated with depression, and the self-coldness factor was positively associated with depression. Positive and negative subscales were negatively and positively associated with depression, respectively. Effects were medium-large and four correlations were r ≥ 0.50.

Results from Muris et al. (2018) were largely as hypothesized. Positive subscales were positively associated with adaptive coping and negatively associated with anxiety and depression. Negative subscales were positively associated with anxiety and depression and less helpful coping strategies. Effects were small-large, and ten correlations were r ≥ 0.50. Gruber et al. (2023) did not offer hypotheses, but findings showed medium-large associations between self-compassion (SCS) and mindfulness, quality of life, internalizing and externalizing symptoms, except for the ‘common humanity’ and ‘mindfulness’ subscales, which showed a small-medium association with the other variables. Twenty correlations were r ≥ 0.50.

Cunha et al. (2016) provided means and standard deviations for total and subscale scores for the total, male and female samples. Subgroup analyses showed males had higher levels of self-compassion, self-kindness, and mindfulness than females. Females had higher levels of self-judgement, isolation, and over-identification than males. Descriptive statistics and percentiles indicated no floor and ceiling effects. Stolow et al. (2016) provided means and standard deviations for subscale scores for the total sample and by school grade and gender. Females and older participants scored higher on the negative subscale than males and younger participants, respectively. There were no gender differences for the positive subscale. Gruber et al. (2023) provided means and standard deviations for total and subscale scores for the total, male and female samples. They found that boys had significantly higher total SCS scores than women and girls reported significantly higher SCS overidentification and isolation scores than men. No other gender differences were found. Interpretability was not facilitated by Kumlander et al. (2018) or Muris et al. (2018). Cunha et al. (2016) provided data which indicated floor and ceiling effects were not present. Gruber et al. (2023) reported that data was not normally distributed (highest mean score for self-judgment subscale, lowest for the overidentification subscales) but normality tests alone cannot confirm floor or ceiling effects.

Summary

The SCS was rated 7/14 for quality. No papers consulted both experts and recipients and the ‘recognizing’ element of compassion was not captured. Evidence for the factor structure was mixed, and two papers did not conduct EFA/CFA. Internal consistency was questionable-excellent, and test–retest was not conducted. There was some evidence for construct validity, subgroup comparisons were conducted, and floor and ceiling effects were not explicitly assessed.

Given the mixed evidence for Neff’s proposed factor structure, it should be examined further across different cultural adaptations to establish a more consistent factor model, taking into consideration the recommendation of Neff et al. (2019) to use bifactor ESEM rather than CFA to capture the multidimensionality of the scale. The internal consistency of subscales with questionable reliability (e.g., subscales with α < 0.70) should be improved and/or implications for recommending use of subscale scores should be considered.

Self-Compassion Scale–Youth (SCS-Y)

Neff et al. (2021) developed and validated a youth version of the SCS. The SCS-Y assesses self-compassion and consists of 17 self-report items with a 5-point response scale (ranging from almost never to almost always). The SCS-Y has six subscales (self-kindness, self-judgment, common humanity, isolation, and over-identification). Six papers validated the SCS-Y (Karakasidou et al., 2021; Deniz et al., 2022; Nazari et al., 2022; Huang et al., 2022; Cheung et al., 2023; Henje et al., 2024).

Population

Neff et al. (2021) developed the SCS-Y with 279 adolescents aged 11–15 (M = 12.17, 56.7% female), cross-validated the factor structure in a second sample of 402 adolescents aged 11–15 (M = 12.43, 48.8% female), examined test re-test reliability in a subset of 102 adolescents aged 11–14 (M = 12.52, 51% female) and examined construct validity with 212 adolescents aged 11–14 (M = 12.18, 42.5% female). All were school samples in the US. Participants were reported to be from socioeconomically diverse schools and the majority identified as White. Karakasidou et al. (2021) validated the SCS-Y with 193 Greek CYP aged 8–14 (M = 11.69, 51.3% male, 100% White), reported to be from socioeconomically diverse schools. Deniz et al. (2022) validated the SCS-Y with 450 Turkish adolescents aged 11–15 (M = 13.09, 61.8% female) and Nazari et al. (2022) with 532 Persian adolescents aged 12–15 (M = 13.57, 50.8% female). Huang et al. (2022) validated the SCS with a Chinese student sample aged 10–17 (M = 12.90, SD = 1.84, 52.5% female) while Cheung et al. (2023) used a community sample of 2881 Chinese (age: M = 16.58, SD = 1.32, 56.8% female, 98% Asian) and 1104 UK (age: M = 16.42, SD = 1.30, 76.1% female, 70% White) 12–18-year-olds. Henje et al. (2024) validated the SCS-Y with Swedish students aged 15–20 (M = 17.07, SD = 1.26, 67% female, 90% born in Sweden), with the majority < 18 (n = 205).

Content Validity

Neff et al. (2021) developed potential items based on the authors’ expertise in cognitive development. A small number of early adolescents and middle school teachers were also consulted. Karakasidou et al. (2021) and Deniz et al. (2022) translated the SCS-Y to Greek and Turkish, respectively, but did not consult with independent bilingual experts or respondents. Nazari et al. (2022) reviewed translated items (Persian) with an expert and students from the target sample. Huang et al. (2022) and Cheung et al. (2023) translated the SCS-Y to Chinese. Both papers used independent back-translators and (a) bilingual expert(s). Huang et al. (2022) consulted 15 CYP to establish readability. Henje et al. (2024) translated the SCS-Y to Swedish; the process of forward translation was not clear; a professional translator conducted the back-translation. It was not reported whether experts were consulted. Although four adolescent girls were consulted, this falls short of the recommended minimum of 10 (Sousa & Rojjanasrirat, 2010). Items related to 4/5 elements of compassion (not recognizing).

Factor Structure and Reliability

Neff et al. (2021) conducted bifactor exploratory structural equation modelling (ESEM) which supported the use of an overall self-compassion score and six subscale scores and was cross-validated in a second sample. Support for this factor structure was provided by Nazari et al. (2022) (ESEM) and Deniz et al. (2022) (CFA). Whilst providing support for this structure, Deniz et al. (2022) also found support for other factor structures and found invariance in all models. Karakasidou et al. (2021) did not support the factor structure reported by Neff et al. (2021), finding this model a poor fit, and instead supported a six-factor structure. Their sample was considerably smaller (n = 193) than for the other studies, which could be a contributing factor. Huang et al. (2022) tested nine models (CFA and ESEM) and found support for a global self-compassion factor and a six-factor (correlated) structure. Cheung et al. (2023) tested a single-, two- and six-factor model (CFA) and found that the six-factor model was the best fit. Henje et al. (2024) tested (two-)bi-factor models (six factors and one global factor and two global factors and six specific factors) using CFA and both proved a good fit.

Neff et al. (2021) found good internal consistency for the total scale (α = 0.82, 0.85), and acceptable internal consistency for five of six subscales (α = 0.70 to 0.80) in both studies. Internal consistency for overidentification (α = 0.66) and mindfulness (α = 0.67) were questionable for study 1 and 2, respectively. Deniz et al. (2022) reported acceptable internal consistency for the total scale (α = 0.79), while Karakasidou et al. (2021) found questionable internal consistency (α = 0.64). Neither reported internal consistency for subscales. Nazari et al. (2022) demonstrated good–excellent internal consistency for the total SCS-Y (α = 0.88) and subscales (α = 0.80 to 0.90). Huang et al. (2022) reported questionable internal consistency for self-judgment (α = 0.64) and common humanity (α = 0.69) and poor internal consistency for overidentification (α = 0.58), which they attributed to possible cultural differences. The remaining subscales and total scale had acceptable-good internal consistency. Cheung et al. (2023) reported acceptable to good internal consistency for the total and sub-scales (across the UK and Chinese samples). Henje et al. (2024) reported acceptable to good internal consistency for the total and sub-scales, apart from mindfulness (α = 0.69). Internal consistency was acceptable-good for girls. For boys, internal consistency was questionable for mindfulness (α = 0.62) and self-kindness (α = 0.68) but acceptable to good for the total and other subscales.

Neff et al. (2021) found good test–retest reliability for the total score (r = 0.83). In relation to subscales, two were adequate (self-kindness, self-judgment, r = 0.70, 0.71), three were questionable (common humanity, isolation, mindfulness, r = 0.63 to 0.65), and one was poor (overidentification, r = 0.51). Nazari et al. (2022) found adequate test–retest reliability for the total scale after 4 weeks (ICC = 0.60). Henje et al. (2024) reported adequate-good test–retest reliability for the total and sub-scales, apart from self-judgment (r = 0.68) and overidentification (r = 0.66), which were questionable. Test–retest reliability was not examined by Karakasidou et al. (2021), Deniz et al. (2022), Huang et al. (2022), and Cheung et al. (2023).

Construct Validity and Interpretability

Neff et al. (2021) found associations in line with hypotheses. Self-compassion was positively associated with happiness, life satisfaction, mindfulness, resilience, and mastery-approach goals, and negatively associated with depression, performance-avoidance goals, and performance-approach goals. Effects were small-large, and three correlations were r ≥ 0.50. At the subscale level, notable differences were a lack of significant association between the common humanity and mindfulness subscale and mindfulness and depression. Neff et al. (2021) attributes the lack of association between the mindfulness subscale and CAMM (mindfulness measure) to the subscale capturing balanced equanimity in relation to suffering rather than paying attention to present-moment experience more generally. Associations with performance avoidance and approach goals were nonsignificant for the positive subscales (self-kindness, common humanity, mindfulness).

Karakasidou et al. (2021) did not outline specific hypotheses, instead making vague predictions that positive associations with similar constructs would be indicative of convergent validity, and negative or no correlation with unrelated constructs would indicate discriminant validity. They found positive associations with life satisfaction, resilience, positive emotions, and happiness, and negative associations with negative emotions, anxiety, depression, and stress, with small-medium effects. The three negative factors (self- judgement, isolation, overidentification) had positive associations with negative emotions, stress, depression, and anxiety, and negative associations with positive emotions, happiness, life satisfaction, and resilience, with small-medium effects. Two positive factors (self- kindness and mindfulness) had positive associations with positive emotions, happiness, life satisfaction, and resilience, and negative associations with negative emotions, stress, and depression, with small-medium effects. Unexpectedly, the common humanity factor correlated positively with stress, anxiety, and negative SCS-Y factors. No correlations were r ≥ 0.50.

Deniz et al. (2022) reported that the SCS-Y was positively related to resilience and wellbeing, and negatively related to depression, congruent with hypotheses. However, this was examined by network analysis and no correlation values were given.

As expected, Nazari et al. (2022) found positive associations between positive factors of self-compassion with resilience and optimism, and negative associations with depression and neuroticism. Negative self-compassion factors also had positive associations with depression and neuroticism, and negative associations with resilience and optimism. Effects were small-medium, and no correlations were r ≥ 0.50. The SCS-Y total had negative associations with depression and neuroticism, and positive associations with resilience and optimism. However, this was examined with SEM analysis and r-values were not reported.

Huang et al. (2022), Cheung et al. (2023) and Henje et al. (2024) did not outline specific hypotheses. Huang et al. (2022) found large negative associations with anxiety and depression. and medium-large positive associations with interpersonal competence, wellbeing, and resilience. Cheung et al. (2023) reported high discriminant ability for items (graded response model) and acceptable discriminant validity for subscales (average variance extraction). The SCS-Y correlated highly with the SCS in both Chinese and UK samples. Henje et al. (2024) reported medium-large negative associations of the SCS-Y total with anxiety and depression, medium negative association with anger, and medium-large positive associations with wellbeing. Associations with functioning, and paediatric pain interference supported divergent validity. The correlation with the SCS was high (0.72). Subscale associations were largely in line with those for the total scale, except for the common humanity and mindfulness’ subscales, which showed small-medium associations with wellbeing, in- and externalising symptoms. The positive SCS-Y subscales showed weaker associations with the total SCS than the (reverse-coded) negative subscales.

Neff et al. (2021) reported means and standard deviations for the total sample and male and female participants for three grades, for three studies. Subgroup analyses were conducted for age, grade, gender, and gender and grade/age interactions explored. No significant differences were found for age or grade, but there was a trend for self-compassion to decrease in higher grades. Males had greater levels of self-compassion in two of three studies (one non-significant). No interaction was found between gender and age, or gender and grade, although one study showed a trend for males to score slightly higher than females, and for females to slightly decrease in self-compassion with age. Nazari et al. (2022) did not report means or standard deviations but also found males had higher levels of self- compassion than females. Karakasidou et al. (2021), Deniz et al. (2022) and Huang et al. (2022) did not aid interpretability. Cheung et al. (2023) compared Chinese and UK participants and tested gender differences (but did not provide M and SD). Boys scored higher on self-compassion than girls on all subscales. Chinese participants reported higher self-compassion subscale scores than UK participants. Henje et al. (2024) reported measurement invariance for age and ambiguous measurement invariance for gender. They reported means and standard deviations for the total, male and female (sub)samples and age groups (15–17 vs 18–20). Girls scored lower on total self-compassion than boys and higher than boys on self-judgment, isolation and over-identification. Age and socio-economic status were not significantly associated with self-compassion.

Normality data (Huang et al., 2022; Karakasidou et al., 2021; Nazari et al., 2022), descriptive statistics (Cheung et al., 2023; Henje et al., 2024) and percentile ranks (Karakasidou et al., 2021) suggested that floor and ceiling effects were not evident.

Summary

The SCS-Y was rated 8/14 for quality. Experts and respondents were mostly consulted on items, but the recognizing element of compassion was not captured. Various models were tested, reflecting the controversy around Neff’s originally proposed factor structure in the adult literature (e.g. Muris & Otgaar, 2020; Neff et al., 2019). However, all studies, except for the study by Karakasidou et al. (2021), supported Neff’s proposed factor structure. Karakasidou et al. (2021) employed a notably smaller sample, reported questionable internal consistency and some unexpected correlations. This suggests methodological weaknesses and/or possible cultural and/or language factors warranting further investigation. Some studies reported that a two-bifactor model was also a good fit. Internal consistency and test–retest reliability ranged from questionable to good. There was some evidence for construct validity, although some papers did not outline hypotheses, report r values, or find multiple large correlations. Floor and ceiling effects did not appear evident and interpretability was facilitated.

Further reliability testing is needed to ensure consistency across diverse cultural samples. While almost all studies confirmed Neff’s proposed factor structure, some also provided evidence for a two-bifactor model, suggesting that further investigation is warranted. Neff (2020) recommend that bifactor ESEM should be used, given the measure’s multidimensionality.

Compassion Scale–Child (CS-C)

Nas and Sak (2021) developed the CS-C, a 20-item self-report measure of compassion with a 5-point response scale (ranging from never to always) that has three subscales: self-compassion (9 items) and compassion for others (5 items) and towards other living things (six items) that consists of 20 self-report items. One paper (Heidary et al., 2020) validated the CS-C.

Population

Nas and Sak’s (2021) sample included 756 Turkish CYP aged 12–18 (M = not reported, 52.4% female) with varied SES. Heidary et al. (2022) validated the CS-C with 302 Iranian CYP aged 12–18 (M = 15.85, 85% female). Neither provided information on ethnicity.

Content Validity

Initial items selected by Nas and Sak (2021) were based on a review of key literature, including the measurement of compassion. Teachers and experts were consulted, and adjustments made following feedback. The measure was then piloted in schools with students who were representative of the target age group. Minor adjustments were made accordingly. Heidary et al. (2022) also consulted students and experts in the field and retranslated one item. Items related to 3/5 elements of compassion (universality, feeling, acting). Around a third of items relate to compassion towards other living things (e.g., environment and animals) and this was the only measure to include this orientation of compassion.

Factor Structure and Reliability

EFA yielded a three-factor structure (compassion toward other people, compassion toward oneself, compassion toward other living things). CFA indicated the three-factor structure was an acceptable fit. CFA by Heidary et al. (2022) supported the three-factor structure. Nas and Sak (2021) reported good internal consistency for the total scale (α = 0.89), and acceptable-good internal consistency for subscales (α = 0.75 to 0.86). Heidary et al. (2022) did not report internal consistency for the total scale, but subscales were acceptable-good (α = 0.70 to 0.87). Neither paper assessed test–retest reliability.

Construct Validity and Interpretability

Construct validity was not examined by Nas and Sak (2021). Congruent with hypotheses, Heidary et al. (2022) found medium-large positive associations with emotion regulation, mindfulness, and wellbeing. Two correlations were r ≥ 0.50. In relation to discriminant validity, they predicted the CS-C would correlate negatively with anxiety and found a negative non-linear association. Neither paper facilitated interpretability nor examined floor and ceiling effects.

Summary

The CS-C was rated 7/14. Experts and the target group were consulted on items but recognizing and tolerating elements of compassion were not captured. Despite initial consultation during item development, the target population should be included in future validation studies to refine items further. The three-factor structure was deemed acceptable, and internal consistency was good-acceptable. There was evidence for construct validity, but interpretability was not facilitated, and test–retest reliability and floor and ceiling effects were not assessed.

Compassion Scale–Adolescence (CS-A)

The CS-A assesses compassion for others and consists of 16 self-report items with a 5-point response scale (ranging from almost never to almost always). The CS-A has four subscales (kindness, mindfulness, common humanity, indifference) with four items each.

Population

Sousa et al. (2022) validated the Portuguese version (Sousa & Rojjanasrirat, 2010) of the adult CS (Pommier et al., 2020) with two adolescent samples in Portugal. The community sample (n = 658) were students from public schools, aged 14–18 (37.8% male). The forensic sample (n = 183) were aged 14–18 (M = 15.96, 100% male), recruited from foster care homes and juvenile detention facilities, and 80.3% had a primary diagnosis of conduct disorder. Participants were from socioeconomically diverse backgrounds. Ethnicity was not reported.

Content Validity

The original CS was a 24-item scale developed by Pommier (2010) for adults, then reduced to its current 16-items (Pommier et al., 2020). Three items were slightly reworded from this to be clearly understood by adolescents, without altering content. This was conducted by the research team with extensive experience with adolescence and measure validation procedures. Translation and backtranslation by a bilingual researcher assured similarity between the adult and adolescent versions. The target population were not consulted. As the final items were not presented in Sousa et al. (2022), the scale by Pommier et al. (2020) was reviewed, taking into account reworded items. Items related to 4/5 elements of compassion (not tolerating).

Factor Structure and Reliability

CFA supported the proposed structure of the CS in the original study (Pommier et al., 2020), of a hierarchical higher-order factor of compassion, and four subscales (kindness, common humanity, mindfulness, indifference).

Internal consistency for the total scale was good for both samples (α = 0.86, 0.88) and acceptable-good for subscales (α = 0.63 to 0.80). Test–retest reliability was not examined.

Construct Validity and Interpretability

As hypothesized, the CS-A had negative associations with external shame and self-criticism (hated self), and positive associations with self-compassion. However, there was not a significant association with a second self- criticism subscale (inadequate self). There was also a positive association between the CS-A subscales of common humanity, and mindfulness, with these same factors on the SCS-A. All effects were small, with no correlations of r ≥ 0.50.

Means and standard deviations for total and subscale scores were provided for community males, community females, and the total forensic sample. Measurement invariance across gender was established and subgroup analyses were conducted. Females in the community sample had higher levels of compassion when compared to males. Female scored higher on all subscales, except indifference, where males had higher levels. Males in the forensic sample had lower levels of compassion towards others than males in the community sample, and lower levels of mindfulness. The indifference subscale was associated with number of diagnoses in the forensic sample. Floor and ceiling effects were not examined.

Summary

The CS-A was rated 5/14. Experts, but not the target population, were consulted on items, and the tolerating element of compassion was not captured. CFA supported the hierarchical higher-order factor of compassion and four subscales, internal consistency was acceptable-good, and interpretability was facilitated. There was some evidence for construct validity, and test–retest reliability and floor and ceiling effects were not examined. Additional evidence is needed to support the hierarchical factor structure identified in the CFA. Alternative models could be explored and, as suggested by Neff et al. (2019), further bifactor analysis could be conducted to clarify the structure. Future studies should specify hypotheses for construct validity testing, to strengthen interpretations of associations with related constructs.

Self-Compassion Scale–Child (SCS-C)

Sutton et al. (2018) developed the SCS-C by modifying the SCS-Short Form (Raes et al., 2011), which has primarily been used with adults. The SCS-C consists of 12-items, rated on a 5-point response scale (ranging from never to always), including two items from each of the six SCS subscales.

Population

The sample included 382 CYP from Canada aged 8–12 (M = 11.3, 50% female, 71% English as first language).

Content Validity

Language of SCS-SF items was altered to be developmentally appropriate, but experts nor recipients were consulted. Items related to 4/5 elements of compassion (not recognizing).

Factor Structure and Reliability

CFA supported the use of a two-factor model as hypothesized, with positively and negatively worded self-compassion items forming two subscales (positive and negative). Internal consistency was good for the two subscales (α = 0.81, 0.83). Test–retest reliability was not assessed.

Construct Validity and Interpretability

Whilst associations were reported to be in expected directions and an aim was to see if the SCS-C related to other constructs in line with previous research, clearly predefined hypotheses were not outlined. The positive subscale was positively associated with self-concept, optimism, empathetic-related responding, prosocial goals, life satisfaction, and mindfulness, and negatively associated with depression and anxiety. The negative subscale was negatively associated with mindfulness, optimism, life satisfaction, and positive affect, and positively associated with negative affect, depression, anxiety, and empathic concern, but not associated with self-concept, perspective- taking, or prosocial goals. Effects were small-large, and three correlations were r ≥ 0.50. Interpretability was not facilitated, and floor and ceiling effects were not examined.

Summary

The SCS-C was rated 4/14 for quality. Experts and recipients were not consulted, and the recognizing element of compassion was not captured. CFA supported the two-factor structure and there was some evidence for construct validity, but predefined hypotheses were not outlined. Internal consistency was good. Few measurement properties were assessed, as test–retest reliability and floor and ceiling effects were not examined, and interpretability was not facilitated.

Shortened Self-Compassion Scale–Adolescence (S-SCS-A)

The S-SCS-A consists of 9 self-report items, on a 5-point response scale (ranging from never to always) and has three subscales (self-kindness, common humanity, mindfulness).

Population

The sample included 132 Dutch adolescents aged 12–17 (M = 14.8, 58% female), from socioeconomically diverse backgrounds (Muris et al., 2016).

Content Validity

S-SCS-A items were modified from the self-kindness, common humanity and mindfulness subscales of the SCS (Neff, 2003b), based on the authors’ critique that including the self-judgment, isolation and overidentification components of compassion inflates the association between compassion and anxiety and depression (also see Muris & Otgaar, 2020; Muris et al., 2024). The modification was guided by three CYP aged 12–15 with varied educational levels. They identified difficult items, and three psychologists modified them. Items related to 4/5 elements of compassion (not recognizing).

Factor Structure and Reliability

PCA yielded three factors (self-kindness, common humanity, mindfulness), with 8/9 items loading most substantially on their intended factor. Neither EFA nor CFA was conducted. Internal consistency was good for the total scale (α = 0.84), and acceptable for subscales (α = 0.74 to 0.79). Test–retest reliability was not examined.

Construct Validity and Interpretability

Vague predictions were made for a link between self-compassion and mental health problems. The S-SCS-A was positively associated with self-esteem and self-efficacy, and negatively associated with anxiety and depression, with small-medium effects. Subscales were positively associated with self-esteem (except common humanity) and self-efficacy, and negatively associated with anxiety (except self- kindness) and depression, with small-large effects. Three correlations were r ≥ 0.50.

Means and standard deviations for total and subscale scores were provided for the total sample, males, and females. No differences for gender or age were found. However, when older (15–17) and younger (13–14) adolescents were analyzed separately, females had lower self-compassion than males in the older sample. Floor and ceiling effects were not assessed.

Summary

The S-SCS-A was rated 4/14 for quality. Experts and the target group were consulted on items, but the recognizing element of compassion was not captured. Whilst PCA yielded three factors, EFA/CFA was not conducted. Internal consistency was good for the total scale and acceptable for subscales, but test–retest was not conducted. Although there was some evidence for construct validity, predefined hypotheses were not outlined. Interpretability was facilitated, but floor and ceiling effects were not assessed.

Self-Compassionate Reactions Scale–Child (SCRS-C)

The SCRS-C consists of four vignettes and 24-items (6 subscales: self-kindness, self-judgment common humanity, isolation, mindfulness, over-identification) informed by Neff’s conceptualisation of self-compassion. Vignettes are written in simple sentences, followed by six items describing possible reactions. Items are rated on a 6-point response scale (ranging from almost never to almost always).

Population

Zhou et al. (2019) piloted the SCRS-C with 161 children (M = 11.49, 36% male), and examined the factor structure with 637 children (M = 10.62, 51% male) and test–retest reliability with a subset of 77 children. All children were Chinese, aged 9–12, and attended school in China. Information about SES was not reported.

Content Validity

To inform vignettes, primary school teachers were interviewed, and children rated the frequency of stressful situations in the classroom. Children were interviewed about vignette situations to help inform items. Experts in compassion research evaluated the meaning of items (consistency with definition of self-compassion) and appropriateness of language for children. Items related to 4/5 elements of compassion (not recognizing).

Factor Structure and Reliability

CFA was conducted but EFA was not conducted beforehand. Whilst items were developed to be consistent with Neff’s (2003a) definition, Zhou et al. (2019) did not make explicit a proposed factor structure for this new scale. A two second-order factor model (compassionate reactions and uncompassionate reactions) with six first-order factors (self-kindness, common humanity, mindfulness, self-judgment, isolation, and over-identification) best fit the data. 0/2 was given for factor structure, as EFA was not conducted, and the factor structure of this newly developed scale was not proposed before CFA.

Internal consistency was good for the total scale (α = 0.81), acceptable-good for second-order subscales (α = 0.78 to 0.81) and unacceptable-acceptable for first-order subscales (α = 0.43 to 0.73). Only the self-kindness subscale reached an acceptable value. Test–retest reliability was poor for the total score (r = 0.46), and questionable for subscales (r = 0.58 to 0.61).

Construct Validity and Interpretability

Predefined hypotheses were not outlined. The SCRS-C total had positive associations with mindfulness, self-acceptance, and self-esteem. The compassionate reactions scale was positively associated with self-esteem but not mindfulness. The uncompassionate reactions scale was negatively associated with self- esteem, self-acceptance, and mindfulness. There were no associations with narcissism, which was used to assess discriminant validity. Overall, effects were small-large, and three correlations were r ≥ 0.50. Interpretability was not facilitated, and floor and ceiling effects were not examined.

Summary

The SCRS-C was rated 2/14 for quality. Experts and the target group were consulted on items, but the recognizing element of compassion was not captured. EFA was not conducted, nor was a factor structure proposed before CFA. CFA supported the use of two second-order factors, with six first-order factors. Test–retest reliability was poor for the total score and questionable for subscales, and internal consistency was good for the total scale, and unacceptable and acceptable for first and second-order subscales, respectively. There was some evidence for construct validity, but predefined hypotheses were not outlined. Interpretability was not facilitated, and floor and ceiling effects were not assessed.

Discussion

The systematic review aimed to describe and critically appraise the psychometric properties of self- and other-compassion measures that were developed and/or validated with children and/or young people (CYP, defined as being younger than 18 years of age). Congruent with a review of adult compassion measures for adults (Strauss et al., 2016), this review of eight measures highlights the lack of psychometrically robust compassion measures for CYP. It also demonstrates increased interest in compassion in CYP in recent years, with more than half of the papers published within the last three years.

Three measures (4 papers) assessed other-compassion and seven (18 papers) assessed self-compassion. The review found that the SCS and its adaptations (SCS-Y, SCS-C, S-SCS-A) also dominate the measurement of self-compassion in CYP. The SCS and other measures did not fully capture the consensual definition of compassion by Strauss et al. (2016) and many papers lacked consultation on items with both experts and recipients, contributing to poor content validity, with no measure scoring the maximum score of 2 for this criterion. Arguably, poor content validity is the most problematic of all criteria. A measure could be rated highly for all other psychometric properties, but if it fails to adequately capture the full breadth of the construct being evaluated, the value of the measure in research and practice is questionable. Therefore, psychometric evaluations of measures for CYP that would be rated highly (i.e. 2) for content validity are needed.

CFA was conducted for seven measures, but mixed evidence emerged for the factor structure of measures based on the SCS, apart from the SCS-Y, reflecting inconsistencies reported in the adult literature (Muris & Petrocchi, 2017; Muris & Otgaar, 2020), suggesting poor validity. Muris and colleagues (e.g. Muris et al., 2021, 2022) argue that including the (reduced) uncompassionate self-responding subscales inflates the relationship of the SCS with internalising symptoms, complicating the exploration of the protective role of compassion. It is also questionable whether lack of uncompassionate responding to the self is equivalent to being self-compassionate. Neff et al. (2019) refuted these points however, reporting empirical support for the use of a total score in adults (and six subscale scores when assessing mechanisms) rather than separate scores for uncompassionate and compassionate self-responding, based on their analysis of 20 (adult) study samples. In the current review, studies into the factor structure of the SCS-Y were largely aligned with this. However, some studies also found good evidence for the two-bifactor model, aligning with the concerns raised by Muris et al.and colleagues (2024). Therefore, as in research with adults, further research is needed to clarify the factor structure of the SCS as adapted for CYP.

Internal consistency was the strongest criterion. However, Cronbach’s alpha was unacceptable for some measures. Only five papers assessed test–retest reliability. All but two were rated zero, questioning the reliability over time of current measures.

Correlations with related constructs were reported for all measures. Three measures met the full criteria for construct validity. Whilst many correlations were weak and some findings inconsistent, correlations were generally significant, medium-large, in expected directions, and suggestive of the importance of self-compassion for wellbeing (Marsh et al., 2018). Other-compassion also had associations suggestive of benefits for wellbeing, but fewer papers examined this. This suggests that offering CBIs to CYP may offer the possibility of improving wellbeing and mental health outcomes. Few papers examined the relationship between other- and self-compassion. Findings were mixed, with small-large associations found (Cunha et al., 2023; Sousa et al., 2022). This suggests more limited understanding of this relationship in CYP and the need for further research.

No paper explicitly examined floor and ceiling effects. Nine papers did not aid interpretability of scores. At least one subgroup analysis was conducted for five measures, with gender the most common comparison. Subgroup analyses generally indicated that females had lower levels of self-compassion than males, but higher levels of compassion for others, supporting research with adults (Yarnell et al., 2015). There was also indication of a trend for self-compassion to decrease with age, particularly for females. Few measures had subgroup analyses conducted for multiple groups to aid interpretability.

The highest rated measure overall was the CEAS-A (9/14), followed by the SCS-Y (8), SCS, and CS-C (both 7/14). None of the measures, however, captured the five elements of compassion or explicitly examined floor and ceiling effects. CFA (and ESEM) was conducted for the CEAS, SCS-Y and CS-C, supporting factor structures, and construct validity was good. However, there was mixed evidence for the SCS factor structure, and inconsistent findings for construct validity. The CEAS-A was the only measure to fully meet criteria for test–retest reliability and the SCS and SCS-Y did not meet full criteria for internal consistency, due to inconsistent findings. However, the CEAS-A, in addition to the SCS, lacked consultation with experts and CYP. The SCS and SCS-Y also focus on self-compassion, meaning they are not suitable to assess compassion towards others.

Identified papers used samples from 11 different countries (Portugal, US, Finland, Netherlands, China, Canada, Turkey, Iran, Greece, Germany, Sweden). The majority of papers did not report socio-economic status or ethnicity of participants. However, of those that did, samples were generally diverse. Lack of consultation with CYP for items is particularly problematic for measures developed for use with adults (e.g., SCS), as it cannot be assumed the language used is developmentally appropriate and this contributes to low ratings for content validity for these measures. Whilst some measures were validated in different countries, with items translated to a different language, recipients/experts were not consulted on the translated versions in most included papers. This therefore poses a risk of items not being suitable for these populations.

One measure assessed compassion towards other living things, which does not fully align with the definition of Strauss et al. (2016) of compassion as being related to “human suffering” and feeling for the “person”. However, given the current climate emergency, this may be an important area for future research.

Some papers conducted PCA and did not conduct EFA or CFA. However, PCA has a greater emphasis on data reduction than interpretation. EFA is recommended when research aims to identify underlying factors, and CFA is recommended to test whether a set of observed variables represent underlying factors (Alavi et al., 2020). Neff et al. (2019) recommended bifactor ESEM to assess the factor structure of the SCS given its multidimensionality, however, as noted above, the inclusion of uncompassionate self-responding items has been questioned for both theoretical and empirical reasons (e.g. see Muris et al., 2024).

Whilst Pearson and Spearman correlation coefficients were reported in two papers for test–retest reliability, these do not take into account systematic differences. The intraclass correlation coefficient (ICC) is reported to be a more suitable measure of reliability as it reflects the degree of correlation and agreement between measurements, taking into account systematic differences (Koo & Li, 2016; Terwee et al., 2007). However, only three (Cunha et al., 2023; Henje et al., 2024; Nazari et al., 2022) papers reported the ICC.

Many papers did not outline predefined hypotheses for construct validity. Where hypotheses were made, these were generally about the direction of correlations (positive or negative), but not about the strength of correlations. However, hypotheses should be as specific as possible (Terwee et al., 2007) as without, there is a high risk of bias as it can be tempting to give alternative explanations for low correlations, instead of concluding the measure has poor validity.

Strengths Limitations and Future Research

A strength of this review is that a broad, stringent and replicable approach was taken to assessing psychometric quality, allowing for comparison of quality across measures and between measures used with CYP and measures used with adults. Whilst a range of definitions of compassion exist, the Strauss et al. (2016) definition was used in this study as it aligns with theories of compassion, was developed to reach consensus across multiple definitions and is supported by factor analysis of items from multiple measures of compassion (Gu et al., 2017). Ratings of content validity are dependent on the adopted definition of compassion. However, this did not affect the rank order of measure quality or alter the outcome that none of the measures obtained the maximum quality score.

This review highlighted the need for valid and reliable measures to assess both self-compassion, and compassion towards others, and for the psychometric properties to be examined across a diverse range of CYP, including CYP from different socio-economic, cultural/language and ethnic backgrounds. Existing measures (including following translation, where applicable) should be revised through thorough consultation with both experts and young recipients during item development/translation, as a critical step towards enhancing content validity, ensuring that items are culturally, linguistically and developmentally appropriate for CYP across diverse backgrounds. Questionnaire items should align with a comprehensive definition of compassion, such as the framework by Strauss et al. (2016), to capture the multidimensional nature of compassion adequately and therefore to ensure strong content validity (arguably, as noted above, the most critical criterion). Gu et al. (2020) developed the Sussex-Oxford Compassion Scales (SOCS) with adults to align with the definition of compassion by Strauss et al. (2016), and this measure has strong psychometric properties in adults (Gu et al., 2020). Muris et al. (2022) advocated for the use of measures such as the SOCS (Gu et al., 2020) with CYP to advance understanding of compassion as a protective construct, given limitations they highlight with regard to the SCS and its derivatives, the currently most widely used measure of compassion with CYP. However, the psychometric properties of the SOCS with CYP are as yet unexamined, with the exception of establishing internal consistency (Muris et al., 2022), and this is an important avenue for future research.

A more rigorous approach to confirmatory factor analysis (CFA) is needed to address multicollinearity, particularly in factor models with high intercorrelations between subscales, to ensure that constructs are distinct and not redundant. Exploratory factor analysis (EFA) could be reported followed by CFA (with data from independent samples contributing to each) when developing new measures or validating existing ones, to ensure that underlying factors are identified and assessed using both bottom-up and top-down approaches.

Future studies should detail hypotheses for construct validity, specifying the strength and direction of correlations, to mitigate bias and support more accurate assessments of construct validity. Test–retest reliability should be evaluated, using intraclass correlation coefficients (ICC), to account for systematic differences, to offer insight into the stability of compassion measures over time, particularly during adolescence. More comprehensive reporting of internal consistency of scales/subscales (Cronbach's alpha) is essential to examine reliability across various populations and settings.

Future studies should also conduct subgroup analyses across multiple demographic groups (e.g., gender, age, socio-economic status) and report the necessary details, such as means, standard deviations, and subgroup analyses, to enhance the interpretability and replicability of findings and provide insights into how compassion measures function across diverse populations of CYP. Consistent reporting of socio-economic status and ethnicity of participants is also recommended to better understand of how these factors might be associated with compassion.

This review underscores the importance of examining and reporting floor and ceiling effects, to understand the range and distribution of compassion scores, ensuring measures are sensitive across the entire range of possible scores, aiding the interpretation of results. Measures (including when translated) should be validated across different cultural contexts and languages, with thorough consultation with local experts and recipients to ensure cultural and linguistic relevance and appropriateness.

Whilst poor test–retest reliability could be seen to undermine the reliability of measures of compassion reviewed here, this may also suggest that compassion is less stable during development and in adulthood. Therefore, future studies should explore developmental aspects of compassion among CYP, including its stability over time and the relationship between self-compassion and other-compassion. Evaluating the psychometric properties of measures such as the SOCS, which measure both self-compassion and other-compassion with potentially stronger content validity than in the measures reviewed here, in CYP could aid this understanding. This exploration could inform interventions aimed at promoting compassion and well-being.

Declarations

Conflict of Interest

CS is co-lead for Sussex Mindfulness Centre and has received funding from the National Institute of Health Research and from the Economic and Social Research Council (ESRC) to evaluate and develop mindfulness and compassion-based interventions and measures. She is the lead author on the Strauss et al. (2016) review and a co-author on the Gu et al. (2017, 2020) articles referenced in the current paper. TL has received ESRC funding to develop and evaluate mindfulness-based interventions. JH declares that they have no conflicts of interest.

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Onze productaanbevelingen

BSL Psychologie Totaal

Met BSL Psychologie Totaal blijf je als professional steeds op de hoogte van de nieuwste ontwikkelingen binnen jouw vak. Met het online abonnement heb je toegang tot een groot aantal boeken, protocollen, vaktijdschriften en e-learnings op het gebied van psychologie en psychiatrie. Zo kun je op je gemak en wanneer het jou het beste uitkomt verdiepen in jouw vakgebied.

Meer informatie

BSL Academy Accare GGZ collective

Meer informatie

BSL GOP_opleiding GZ-psycholoog

Meer informatie

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 60 KB)

Abela, J. R. Z., & Taxel, E. (2001). Vulnerability to specific life events: Personality predispositions to depression in schoolchildren. Paper presented at the meeting of the Canadian Psychological Association.

Alavi, M., Visentin, D. C., Thapa, D. K., Hunt, G. E., Watson, R., & Cleary, M. (2020). Exploratory factor analysis and principal component analysis in clinical studies: Which one should you use. Journal of Advanced Nursing, 76(8), 1886–1889. https://doi.org/10.1111/jan.14377CrossRefPubMed

Barker, C., Pistrang, N., & Elliott, R. (2002). Research methods in clinical psychology: An introduction for students and practitioners. Wiley.CrossRef

Beck, J. S., Beck, T. A., & Jolly, B. J. (2001). Manual for the Beck Youth Inventories of Emotional and Social Impairment. The Psychological Corporation.

Bengtsson, H., Söderström, M., & Terjestam, Y. (2016). The structure and development of dispositional compassion in early adolescence. The Journal of Early Adolescence, 36(6), 840–873. https://doi.org/10.1177/0272431615594461CrossRef

Bijstra, J. O., Jackson, S., & Bosma, H. A. (1994). De Utrechtse Coping Lijst voor Adolescenten (The Utrecht Coping List for Adolescents). Kind Enōsis Adolescent, 15(67), 98–109. https://doi.org/10.1007/BF03060546CrossRef

Birmaher, B., Khetarpal, S., Brent, D., Cully, M., Balach, L., Kaufman, J., & Neer, S. M. (1997). The screen for child anxiety related emotional disorders (SCARED): Scale construction and psychometric characteristics. Journal of the American Academy of Child & Adolescent Psychiatry, 36(4), 545–553. https://doi.org/10.1097/00004583-199704000-00018CrossRef

Blom, E. H., Bech, P., Hogberg, G., Larsson, J. O., & Serlachius, E. (2012). Screening for depressed mood in an adolescent psychiatric context by brief self-assessment scales - testing psychometric validity of WHO-5 and BDI-6 indices by latent trait analyses. Health Quality of Life Outcomes, 10(1), 149. https://doi.org/10.1186/1477-7525-10-149CrossRefPubMedPubMedCentral

Boateng, G. O., Neilands, T. B., Frongillo, E. A., Melgar-Quiñonez, H. R., & Young, S. L. (2018). Best practices for developing and validating scales for health, social, and behavioral research: A primer. Frontiers in Public Health, 6, 149. https://doi.org/10.3389/fpubh.2018.00149

Bluth, K., & Clepper-Faith, M. (2023). Self-compassion in adolescence. In A. Finlay-Jones, K. Bluth, & K. Neff (Eds.), Handbook of self-compassion (pp. 89–107). Springer Nature Switzerland AG. https://doi.org/10.1007/978-3-031-22348-8_6

Bluth, K., Campo, R. A., Futch, W. S., & Pearson, J. L. (2017). Age and gender differences in the associations of self-compassion and emotional well-being in a large adolescent sample. Journal of Youth and Adolescence, 46(4), 840–853. https://doi.org/10.1007/s10964-016-0567-2CrossRefPubMed

Campbell-Sills, L., & Stein, M. B. (2007). Psychometric analysis and refinement of the Connor-Davidson Resilience Scale (CD-RISC): Validation of a 10-item measure of resilience. Journal of Traumatic Stress, 20(6), 1019–1028. https://doi.org/10.1002/jts.20271CrossRefPubMed

Pinto-gouveia, J., Cunha, M., Xavier, A., & Castilho, P. (2011). Escala de AutoCompaixão para Adolescentes [Self-Compassion Scale for Adolescents]. Manuscript in preparatio

Cella, D., Gershon, R., Lai, J. S., & Choi, S. (2007). The future of outcomes measurement: Item banking, tailored short-forms, and computerized adaptive assessment. Quality of Life Research, 16(1), 133–141. https://doi.org/10.1007/s11136-007-9204-6CrossRefPubMed

Cheung, H. N., Ho, W. S., Habibi Asgarabad, M., Chan, S. W. Y., & Williams, J. (2023). A multiple indicator multiple cause (MIMIC) model of the Self-Compassion Scale Youth (SCS-Y) and investigation of differential item functioning in China. Hong Kong and UK Adolescents. Mindfulness, 14(8), 1967–1979. https://doi.org/10.1007/s12671-023-02170-zCrossRef

Chorpita, B. F., Moffitt, C. E., & Gray, J. (2005). Psychometric properties of the revised child anxiety and depression scale in a clinical sample. Behaviour Research and Therapy, 43(3), 309–322. https://doi.org/10.1016/j.brat.2004.02.004CrossRefPubMed

Cohen, J. (1988). Statistical Power Analysis for the Behavioral Sciences. Routledge.

Compassion in Education Foundation. (2016). The Fourth Strategic Plan: ‘The Pen is Mightier than the Sword’ 2018–2020. https://www.coedfoundation.org.uk/pdfs/CoEd_Foundation_strategic_plan.pdf

Cong, Z., & Gao, W. F. (1999). The development of self-acceptance questionnaire and the test of its reliability and validity. Chinese Journal of Behavioral Medical Science, 8(1), 20–22.

Coroiu, A., Meyer, A., Gomez-Garibello, C. A., Brahler, E., Hessel, A., & Korner, A. (2015). Brief form of the Interpersonal Competence Questionnaire (ICQ-15): Development and preliminary validation with a German population sample. European Journal of Psychological Assessment, 31(4), 272–279. https://doi.org/10.1027/1015-5759/a000234CrossRef

Cruz, D., Narciso, I., Pereira, C. R., & Sampaio, D. (2014). A Short Form of the Portuguese Version of the Youth Self-Report. Journal of Child and Family Studies, 23(6), 1114–1127. https://doi.org/10.1007/s10826-013-9770-xCrossRef

Cunha, M., Xavier, A., & Vitória, I. (2013). Avaliação da auto-compaixão em adolescentes: Adaptação e qualidades psicométricas da Escala de Auto-Compaixão [Assessment of self-compassion in adolescents: Adaptation and psychometric properties of the self compassion scale]. Journal of Child and Adolescent Psychology, 4, 95–117.

Cunha, M., Xavier, A., Martinho, M. I., & Matos, M. (2014). Measuring positive emotional memories in adolescents: Psychometric properties and confirmatory factor analysis of the Early Memories of Warmth and Safeness Scale. International Journal of Psychology and Psychological Therapy, 14, 245–259. https://www.redalyc.org/pdf/560/56031293007.pdf.

Cunha, M., Xavier, A., Cherpe, S., & Pinto-Gouveia, J. (2015). Avaliação da Vergonha em Adolescentes: “The Other as Shamer Scale.” Psicologia: Teoria e Pesquisa (Brasília), 33, 1–19.

Cunha, M., Xavier, A., & Castilho, P. (2016). Understanding self-compassion in adolescents: Validation study of the Self-Compassion Scale. Personality and Individual Differences, 93, 56–62. https://doi.org/10.1016/j.paid.2015.09.023CrossRef

Cunha, M., Galhardo, A., Gilbert, P., Rodrigues, C., & Matos, M. (2023). The flows of compassion in adolescents as measured by the compassionate engagement and action scales. Current Psychology, 42, 7737–7751. https://doi.org/10.1007/s12144-021-02097-5. Advance online publication.CrossRef

Davis, M. H. (1983). Measuring individual differences in empathy: Evidence for a multidimensional approach. Journal of Personality and Social Psychology, 44(1), 113–126. https://doi.org/10.1037/0022-3514.44.1.113CrossRef

Deniz, M., Satici, S. A., Doenyas, C., & Caglar, A. (2022). Self-compassion scale for youth: Turkish adaptation and exploration of the relationship with resilience, depression, and well-being. Child Indicators Research, 15(4), 1255–1267. https://doi.org/10.1007/s12187-022-09915-7CrossRefPubMedPubMedCentral

Department of Health. (2013). NHS Constitution for England. https://www.gov.uk/government/publications/the-nhs-constitution-for-england#history

Diener, E., Wirtz, D., Tov, W., Kim-Prieto, C., Choi, D.-W., Oishi, S., & Biswas-Diener, R. (2010). New well-being measures: Short scales to assess flourishing and positive and negative feelings. Social Indicators Research, 97(2), 143–156. https://doi.org/10.1007/s11205-009-9493-yCrossRef

Eisenberg, N. (2000). Emotion, regulation, and moral development. Annual Review of Psychology, 51, 665–697. https://doi.org/10.1146/annurev.psych.51.1.665CrossRefPubMed

Eisenberg, N. (2017). Altruism and prosocial behavior in childhood and adolescence. In J. E. Cassel & E. L. Worthington (Eds.), Handbook of compassion science (pp. 61–78). Guilford Press.

Ey, S., Hadley, W., Allen, D. N., Palmer, S., Klosky, J., Deptula, D., Thomas, J., & Cohen, R. (2005). A new measure of children’s optimism and pessimism: The youth life orientation test. Journal of Child Psychology and Psychiatry, 46(5), 548–558. https://doi.org/10.1111/j.1469-7610.2004.00372.xCrossRefPubMed

Faulstich, M. E., Carey, M. P., Ruggiero, L., Enyart, P., & Gresham, F. (1986). Assessment of depression in childhood and adolescence: An evaluation of the Center for Epidemiological Studies Depression Scale for Children (CES-DC). The American Journal of Psychiatry, 143(8), 1024–1027. https://doi.org/10.1176/ajp.143.8.1024CrossRefPubMed

Feldman, C., & Kuyken, W. (2011). Compassion in the landscape of suffering. Contemporary Buddhism, 12(1), 143–155. https://doi.org/10.1080/14639947.2011.564831CrossRef

Ferrari, M., Hunt, C., Harrysunker, A., Abbott, M. J., Beath, A. P., & Einstein, D. A. (2019). Self- compassion interventions and psychosocial outcomes: A meta-analysis of RCTs. Mindfulness, 10(8), 1455–1473. https://doi.org/10.1007/s12671-019-01134-6CrossRef

Gadermann, A. M. (2009). The satisfaction with life scale adapted for children: Investigating the structural, external, and substantive aspects of construct validity [Doctoral dissertation, University of British Columbia. https://open.library.ubc.ca/soa/cIRcle/collections/ubctheses/24/items/1.0054573

George, D., & Mallery, P. (2003). SPSS for Windows step by step: A simple guide and reference,11.0 update (4th ed.). Pearson.

Gilbert, P. (2014). The origins and nature of compassion focused therapy. British Journal of Clinical Psychology, 53(1), 6–41. https://doi.org/10.1111/bjc.12043CrossRefPubMed

Gilbert, P. (2015). The evolution and social dynamics of compassion. Social and Personality Psychology Compass, 9(6), 239–254. https://doi.org/10.1111/spc3.12176CrossRef

Gilbert, P., & van Gordon, W. (2023). Compassion as a skill: A comparison of contemplative and evolution-based approaches. Mindfulness, 14(10), 2395–2416. https://doi.org/10.1007/S12671-023-02173-W/FIGURES/4CrossRef

Gilbert, P., Clarke, M., Hempel, S., Miles, J. N., & Irons, C. (2004). Criticizing and reassuring oneself: An exploration of forms, styles, and reasons in female students. Journal of Clinical Psychology, 43(1), 31–50. https://doi.org/10.1348/014466504772812959CrossRef

Gilbert, P., Catarino, F., Duarte, C., et al. (2017). The development of compassionate engagement and action scales for self and others. Journal of Compassionate Health Care, 4, 4. https://doi.org/10.1186/s40639-017-0033-3

Goetz, J. L., Keltner, D., & Simon-Thomas, E. (2010). Compassion: An evolutionary analysis and empirical review. Psychological Bulletin, 136(3), 351–374. https://doi.org/10.1037/a0018807CrossRefPubMedPubMedCentral

Goodman, R. (2001). Psychometric properties of the strengths and difficulties questionnaire. The Journal of the American Academy of Child & Adolescent Psychiatry, 40(11), 1337–1345. https://doi.org/10.1097/00004583-200111000-00015CrossRef

Greco, L. A., Baer, R. A., & Smith, G. T. (2011). Assessing mindfulness in children and adolescents: Development and validation of the Child and Adolescent Mindfulness Measure (CAMM). Psychological Assessment, 23(3), 606–614. https://doi.org/10.1037/a0022819

Gruber, E., Baumann, I., Vonderlin, E., & Kater, L. (2023). Translation and factorial validation of the Self-Compassion Scale for Adolescents in a German community sample. Journal of Child and Family Studies, 32, 1318–1331. https://doi.org/10.1007/s10826-023-02569-1CrossRef

Gu, J., Cavanagh, K., Baer, R., & Strauss, C. (2017). An empirical examination of the factor structure of compassion. PLoS ONE, 12(2), e0172471. https://doi.org/10.1371/journal.pone.0172471

Gu, J., Baer, R., Cavanagh, K., Kuyken, W., & Strauss, C. (2020). Development and psychometric properties of the Sussex-Oxford Compassion Scales (SOCS). Assessment, 27(1), 3–20. https://doi.org/10.1177/1073191119860911CrossRefPubMed

Gullone, E., & Taffe, J. (2012). The Emotion Regulation Questionnaire for Children and Adolescents (ERQ–CA): A psychometric evaluation. Psychological Assessment, 24(2), 409–417. https://doi.org/10.1037/a0025777CrossRefPubMed

Harter, S. (1985). Manual for the self-perception profile for children. University of Denver.

Hatchel, T., Merrin, G. J., & Espelage, A. D. (2019). Peer victimization and suicidality among LGBTQ youth: The roles of school belonging, self-compassion, and parental support. Journal of LGBT Youth, 16(2), 134–156. https://doi.org/10.1080/19361653.2018.1543036CrossRef

Heidary, S., Abdollahi, A., Jarrahi, E., & Ahmed, A. A. A. (2022). Psychometric Assessment of the Persian Version of Compassion Scale for Adolescents. Evaluation & the Health Professions, 45(3), 297–302. https://doi.org/10.1177/01632787211053855CrossRef

Henje, E., Wiberg, M., & Dennhag, I. (2024). Translation and validation of the Swedish version of the Self-Compassion Scale for Youth. Australian Psychologist, 59(1), 24–35. https://doi.org/10.1080/00050067.2023.2217326

Huang, L., Chen, Z., Jiang, W., & Liu, Q. (2022). Validation of the Chinese version of Self-Compassion Scale for Youth (SCS-Y). Mindfulness, 13(12), 3166–3178. https://doi.org/10.1007/s12671-022-02024-0CrossRef

Jiang, Y., You, J., Hou, Y., Du, C., Lin, M. P., Zheng, X., & Ma, C. (2016). Buffering the effects of peer victimization on adolescent non-suicidal self-injury: The role of self-compassion and family cohesion. Journal of Adolescence, 53, 107–115. https://doi.org/10.1016/j.adolescence.2016.09.005CrossRefPubMed

Jonason, P. K., & Webster, G. D. (2010). The dirty dozen: A concise measure of the dark triad. Psychological Assessment, 22(2), 420–432. https://doi.org/10.1037/a0019265CrossRefPubMed

Kaltiala-Heino, R., Rimpelä, M., Rantanen, P., & Laippala, P. (1999). Finnish modification of the 13-item Beck Depression Inventory in screening an adolescent population for depressiveness and positive mood. Nordic Journal of Psychiatry, 53(6), 451–457. https://doi.org/10.1080/080394899427700CrossRef

Kappelmayer, M., Czar, A., Tresca, M., D’Adamo, P., & Lozada, M. (2022). A school intervention promotes compassion, empathy and social relationships in children. School Psychology International, 44(5), 515–532. https://doi.org/10.1177/01430343221145668

Karakasidou, E., Raftopoulou, G., Pezirkianidis, C., & Stalikas, A. (2021). Validity, reliability, and factorial structure of the Self Compassion Scale-Youth version in the Greek population. Psychology, 12(4), 536–553. https://doi.org/10.4236/psych.2021.124033CrossRef

Kaufman, E. A., Xia, M., Fosco, G., Yaptangco, M., Skidmore, C. R., & Crowell, S. E. (2016). The Difficulties in Emotion Regulation Scale Short Form (DERS-SF): Validation and replication in adolescent and adult samples. Journal of Psychopathology and Behavioral Assessment, 38(3), 443–455. https://doi.org/10.1007/s10862-015-9529-3CrossRef

Khanjani, M., Shahidi, S., Fathabadi, J., Mazaheri, M., & Shokri, O. (2014). Factor structure and psychometric properties of the Ryff’s scale of psychological well-being, short form (18-item) among male and female students. Thoughts and Behavior in Clinical Psychology, 9(32), 27–36. https://jtbcp.riau.ac.ir/article_67_en.html.

Kirby, J. N., Tellegen, C. L., & Steindl, S. R. (2017). A meta-analysis of compassion-based interventions: Current state of knowledge and future directions. Behavior Therapy, 48(6), 778–792. https://doi.org/10.1016/j.beth.2017.06.003CrossRefPubMed

Koo, T. K., & Li, M. Y. (2016). A guideline of selecting and reporting intraclass correlation coefficients for reliability research. Journal of Chiropractic Medicine, 15(2), 155–163. https://doi.org/10.1016/j.jcm.2016.02.012CrossRefPubMedPubMedCentral

Koopmann-Holm, B., Bruchmann, K., Fuchs, M., & Pearson, M. (2021). What constitutes a compassionate response? The Important Role of Culture. Emotion, 21(8), 1610–1624. https://doi.org/10.1037/emo0001007CrossRefPubMed

Kovacs, M. (1981). Rating scales to assess depression in school-aged children. Acta Paedo-Psychiatrica: International Journal of Child & Adolescent Psychiatry, 46(5–6), 305–315.

Kroenke, K., Spitzer, R. L., & Williams, J. B. W. (2001). The PHQ-9: Validity of a brief depression severity measure. Journal of General Internal Medicine, 16(9), 606–613. https://doi.org/10.1046/j.1525-1497.2001.016009606.xCrossRefPubMedPubMedCentral

Kroenke, K., Spitzer, R. L., & Williams, J. B. (2003). The Patient Health Questionnaire-2: Validity of a two-item depression screener. Medical Care, 41(11), 1284–1292. https://doi.org/10.1097/01.MLR.0000093487.78664.3CCrossRefPubMed

Kumlander, S., Lahtinen, O., Turunen, T., & Salmivalli, C. (2018). Two is more valid than one, but is six even better? The factor structure of the Self-Compassion Scale (SCS). PLoS ONE, 13(12), e0207706. https://doi.org/10.1371/journal.pone.0207706CrossRefPubMedPubMedCentral

Kusche, C. A., Greenberg, M. T., & Beilke, R. (1988). Seattle personality questionnaire for young schoolaged children. Unpublished personality questionnaire. Seattle.

Landis, J. R., & Koch, G. G. (1977). The measurement of observer agreement for categorical data. Biometrics, 33(1), 159–174. https://doi.org/10.2307/2F2529310CrossRefPubMed

Lathren, C. R., Rao, S. S., & Park, J. (2021). Self-compassion and current close interpersonal relationships: A scoping literature review. Mindfulness, 12(5), 1078–1093. https://doi.org/10.1007/s12671-020-01566-5CrossRefPubMedPubMedCentral

Laurent, J., Catanzaro, S. J., Joiner, T. E., Jr., Rudolph, K. D., Potter, K. I., Lambert, S., Osborne, L., & Gathright, T. (1999). A measure of positive and negative affect for children: Scale development and preliminary validation. Psychological Assessment, 11(3), 326–338. https://doi.org/10.1037/1040-3590.11.3.326CrossRef

Lawlor, M. S., Schonert-Reichl, K. A., Gadermann, A. M., & Zumbo, B. D. (2014). A validation study of the Mindful Attention Awareness Scale adapted for children. Mindfulness, 5(6), 730–741. https://doi.org/10.1007/s12671-013-0228-4CrossRef

Liao, X., Wang, J., Zhang, F., Luo, Z., Zeng, Y., & Wang, G. (2022). The levels and related factors ofcompassion fatigue and compassion satisfaction among family caregivers: A systematic review and meta-analysis of observational studies. Geriatric Nursing, 45, 1–8. https://doi.org/10.1016/j.gerinurse.2022.02.016CrossRefPubMed

Lovibond, P. F., & Lovibond, S. H. (1995). The structure of negative emotional states: Comparison of the Depression Anxiety Stress Scales (DASS) with the Beck Depression and Anxiety Inventories. Behaviour Research and Therapy, 33(3), 335–343. https://doi.org/10.1016/0005-7967(94)00075-U

Lyubomirsky, S., & Lepper, H. S. (1999). A measure of subjective happiness: Preliminary reliability and construct validation. Social Indicators Research, 46(2), 137–155. https://doi.org/10.1023/A:1006824100041CrossRef

Matos, M., Albuquerque, I., Galhardo, A., Cunha, M., Pedroso Lima, M., Palmeira, L., Petrocchi, N., McEwan, K., Maratos, F. A., & Gilbert, P. (2022). Nurturing compassion in schools: A randomized controlled trial of the effectiveness of a compassionate mind training program for teachers. PLoS ONE, 17(3), 0263480. https://doi.org/10.1371/journal.pone.0263480

Marques, S. C., Pais-Ribeiro, J. L., & Lopez, S. J. (2007). Validation of a Portuguese version of the Students’ Life Satisfaction Scale. Applied Research in Quality of Life, 2(2), 83–94. https://doi.org/10.1007/s11482-007-9031-5CrossRef

Marsh, H. W. (1988). The self-description questionnaire (SDQ): A theoretical and empirical basis for the measurement of multiple dimensions of preadolescent self-concept: A test manual and a research monograph. The Psychological Corporation.

Marsh, I. C., Chan, S. W., & MacBeth, A. (2018). Self-compassion and psychological distress in adolescents - a meta-analysis. Mindfulness, 9(4), 1011–1027. https://doi.org/10.1007/s12671-017-0850-7CrossRefPubMed

Mattejat, F., & Remschmidt, H. (2006). ILK Inventar zur Erfassung der Lebensqualität bei Kindern und Jugendlichen. Huber.

Midgley, C., Kaplan, A., Middleton, M., Maehr, M. L., Urdan, T., Anderman, L. H., Anderman, E., & Roeser, R. (1998). The development and validation of scales assessing students’ achievement goal orientations. Contemporary Educational Psychology, 23(2), 113–131. https://doi.org/10.1006/ceps.1998.0965CrossRefPubMed

Morin, A. J. S., Marsh, H. W., & Nagengast, B. (2013). Exploratory structural equation modeling. In G. R. Hancock & R. O. Mueller (Eds.), Structural equation modeling: A second course (pp. 395–436). IAP Information Age Publishing.

Morin, A. J., Arens, A. K., & Marsh, H. W. (2016). A bifactor exploratory structural equation modeling framework for the identification of distinct sources of construct-relevant psychometric multidimensionality. Structural Equation Modeling, 23(1), 116–139. https://doi.org/10.1080/10705511.2014.961800

Munn, Z., Stern, C., Aromataris, E., Lockwood, C., & Jordan, Z. (2018). What kind of systematic review should I conduct? A proposed typology and guidance for systematic reviewers in the medical and health sciences. BMC Medical Research Methodology, 18, 5. https://doi.org/10.1186/s12874-017-0468-4

Muris, P. (2001). A brief questionnaire for measuring self-efficacy in youths. Journal of Psychopathology and Behavioral Assessment, 23(3), 145–149. https://doi.org/10.1023/A:1010961119608CrossRef

Muris, P., & Otgaar, H. (2020). The process of science: A critical evaluation of more than 15 years of research on self-compassion with the Self-Compassion Scale. Mindfulness, 11(6), 1469–1482. https://doi.org/10.1007/s12671-020-01363-0CrossRef

Muris, P., Meesters, C., Pierik, A., & de Kock, B. (2016). Good for the self: Self-compassion and other self-related constructs in relation to symptoms of anxiety and depression in non-clinical youths. Journal of Child and Family Studies, 25(2), 607–617. https://doi.org/10.1007/s10826-015-0235-2CrossRefPubMed

Muris, P., & Petrocchi, N. (2017). Protection or vulnerability? A meta-analysis of the relations between the positive and negative components of self-compassion and psychopathology. Clinical Psychology & Psychotherapy, 24(2), 373–383. https://doi.org/10.1002/cpp.2005

Muris, P., van den Broek, M., Otgaar, H., Oudenhoven, I., & Lennartz, J. (2018). Good and bad sides of self-compassion: A face validity check of the self-compassion scale and an investigation of its relations to coping and emotional symptoms in non-clinical adolescents. Journal of Child and Family Studies, 27(8), 2411–2421. https://doi.org/10.1007/s10826-018-1099-zCrossRefPubMedPubMedCentral

Muris, P., Otgaar, H., López, A., Kurtic, I., & Van de Laar, I. (2021). The (non)protective role of self-compassion in internalizing symptoms: Two empirical studies in adolescents demonstrating unwanted effects of using the Self-Compassion Scale total score. Mindfulness, 12(1), 240–252. https://doi.org/10.1007/s12671-020-01514-3

Muris, P., Bongers, K., Schenning, C., Meesters, C., & Otgaar, H. (2022). Self-compassion correlates of anxiety and depression symptoms in youth: A comparison of two self-compassion measures. Children, 9(12), 1930. https://doi.org/10.3390/children9121930CrossRefPubMedPubMedCentral

Muris, P., Fernández-Martínez, I., & Otgaar, H. (2024). On the edge of psychopathology: Strong relations between reversed self-compassion and symptoms of anxiety and depression in young people. Clinical Child and Family Psychology Review, 27(2), 407–423. https://doi.org/10.1007/s10567-024-00471-wCrossRefPubMedPubMedCentral

Nas, E., & Sak, R. (2021). A new compassion scale for children and its psychometric properties. Child Psychiatry & Human Development, 52(3), 477–487. https://doi.org/10.1007/s10578-020-01034-0CrossRef

Nazari, N., Hernández, R. M., Ocaña-Fernandez, Y., & Griffiths, M. D. (2022). Psychometric validation of the Persian Self-Compassion Scale youth version. Mindfulness, 13(2), 385–397. https://doi.org/10.1007/s12671-021-01801-7CrossRefPubMedPubMedCentral

Neff, K. (2003a). Self-compassion: An alternative conceptualization of a healthy attitude toward oneself. Self and Identity, 2(2), 85–101. https://doi.org/10.1080/15298860309032CrossRef

Neff, K. D. (2003b). The development and validation of a scale to measure self-compassion. Self and Identity, 2(3), 223–250. https://doi.org/10.1080/15298860309027CrossRef

Neff, K. D. (2020). Commentary on Muris and Otgaar (2020): Let the empirical evidence speak on the Self-Compassion Scale. Mindfulness, 11(8), 1900–1909. https://doi.org/10.1007/s12671-020-01411-9

Neff, K. D. (2023). Self-compassion: Theory, method, research, and intervention. Annual Review of Psychology, 74, 193–218. https://doi.org/10.1146/annurev-psych-032420-031047CrossRefPubMed

Neff, K. D., & McGehee, P. (2010). Self-compassion and psychological resilience among adolescents and young adults. Self and Identity, 9(3), 225–240. https://doi.org/10.1080/15298860902979307CrossRef

Neff, K. D., Tóth-Király, I., Yarnell, L. M., Arimitsu, K., Castilho, P., Ghorbani, N., Guo, H. X., Hirsch, J. K., Hupfeld, J., Hutz, C. S., Kotsou, I., Lee, W. K., Montero-Marin, J., Sirois, F. M., de Souza, L. K., Svendsen, J. L., Wilkinson, R. B., & Mantzios, M. (2019). Examining the factor structure of the Self-Compassion Scale in 20 diverse samples: Support for use of a total score and six subscale scores. Psychological Assessment, 31(1), 27–45. https://doi.org/10.1037/pas0000629CrossRefPubMed

Neff, K. D., Bluth, K., Tóth-Király, I., Davidson, O., Knox, M. C., Williamson, Z., & Costigan, A. (2021). Development and validation of the self-compassion scale for youth. Journal of Personality Assessment, 103(1), 92–105. https://doi.org/10.1080/00223891.2020.1729774CrossRefPubMed

Oberle, E., Schonert-Reichl, K. A., & Thomson, K. C. (2010). Understanding the link between social and emotional well-being and peer relations in early adolescence: Gender-specific predictors of peer acceptance. Journal of Youth and Adolescence, 39, 1330–1342. https://doi.org/10.1007/s10964-009-9486-9

Page, M. J., McKenzie, J. E., Bossuyt, P. M., Boutron, I., Hoffmann, T. C., Mulrow, C. D., Shamseer, L., Tetzlaff, J. M., Akl, E. A., Brennan, S. E., Chou, R., Glanville, J., Grimshaw, J. M., Hróbjartsson, A., Lalu, M. M., Li, T., Loder, E. W., Mayo-Wilson, E., McDonald, S., … Moher, D. (2021). The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. BMJ, 372, n71. https://doi.org/10.1136/bmj.n71CrossRefPubMedPubMedCentral

Pais-Ribeiro, J. L., Honrado, A., & Leal, I. (2004). Contribuição para o estudo da adaptação Portuguesa das escalas de ansiedade, depressão e stress (EADS) de 21 itens de Lovibond e Lovibond [Contribution to the adaptation study of the Portuguese adaptation of the Lovibond and Lovibond Depression Anxiety Stress Scales (EADS) with 21 items]. Psicologia, Saúde & Doenças, 5, 229–239.

Pommier E. A. (2010). The compassion scale (Doctoral dissertation). Available from https://repositories.lib.utexas.edu/handle/2152/ETD-UT-2010-12-2213. Accessed 8 Apr 2025.

Pommier, E., Neff, K. D., & Tóth-Király, I. (2020). The development and validation of the compassion scale. Assessment, 27(1), 21–39. https://doi.org/10.1177/1073191119874108

Raes, F., Pommier, E., Neff, K. D., & Van Gucht, D. (2011). Construction and factorial validation of a short form of the self-compassion scale. Clinical Psychology & Psychotherapy, 18(3), 250–255. https://doi.org/10.1002/cpp.702CrossRef

Rammstedt, B., & John, O. P. (2007). Measuring personality in one minute or less: A 10-item short version of the Big Five Inventory in English and German. Journal of Research in Personality, 41(1), 203–212. https://doi.org/10.1016/j.jrp.2006.02.001CrossRef

RefWorks. (n.d.). RefWorks [Computer software]. ProQuest. https://refworks.proquest.com/

Richter, A., Gilbert, P., & McEwan, K. (2009). Development of an early memories of warmth and safeness scale and its relationship to psychopathology. Psychology and Psychotherapy: Theory, Research and Practice, 82(2), 171–184. https://doi.org/10.1348/147608308X395213

Roeser, R. W., Colaianne, B. A., & Greenberg, M. A. (2018). Compassion and human development: Current approaches and future directions. Research in Human Development, 15(3–4), 238–251. https://doi.org/10.1080/15427609.2018.1495002CrossRef

Rosenberg, M. (1965). Society and the adolescent self-image. Princeton University Press.CrossRef

Seekis, V., Farrell, L., & Zimmer-Gembeck, M. (2023). A classroom-based pilot of a self-compassion intervention to increase wellbeing in early adolescents. Explore, 19(2), 267–270. https://doi.org/10.1016/j.explore.2022.06.003

Selman, R. L. (1980). The growth of interpersonal understanding: Developmental and clinical analyses. Academy Press.

Sinclair, S., Kondejewski, J., Jaggi, P., Rozed es Ordons, A. L., Kassam, A., Hayden, K. A., Harris, D., & Hack, T. F. (2021). What works for whom in compassion training programs offered to practicing healthcare providers: A realist review. BMC Medical Education, 21(1), 455. https://doi.org/10.1186/s12909-021-02863-w

Silva, C., & Salvador, M. C. (2010). A Escala das Formas de Autocriticismo e de Auto-Tranquilização (FSCRS): Características psicométricas na população adolescente. [Unpublished manuscript]. [The forms of Self-criticizing/Attacking and self-reassuring scale: Psychometric characteristics in adolescents’ population]. Faculty of Psychology and Educational Sciences of the University of Coimbra.

Singh, P., King-Shier, K., & Sinclair, S. (2018). The colours and contours of compassion: A systematic review of the perspectives of compassion among ethnically diverse patients and healthcare providers. PLoS ONE, 13(5), e0197261. https://doi.org/10.1371/journal.pone.0197261CrossRefPubMedPubMedCentral

Smith, B. W., Dalen, J., Wiggins, K., Tooley, E., Christopher, P., & Bernard, J. (2008). The Brief Resilience Scale: Assessing the ability to bounce back. International Journal of Behavioral Medicine, 15(3), 194–200. https://doi.org/10.1080/10705500802222972CrossRefPubMed

Song, M. (2003). Two studies on the resilience inventory (RI): Toward the goal of creating a culturally sensitive measure of adolescent resilience. Dissertation abstracts international, 64 (8B). (UMI no.3100166). Harvard: ProQuest information and learning.

Sousa, V. D., & Rojjanasrirat, W. (2010). Translation, adaptation and validation of instruments or scales for use in cross-cultural health care research: A clear and user-friendly guideline. Journal of Evaluation in Clinical Practice, 17(2), 268–274. https://doi.org/10.1111/j.1365-2753.2010.01434.xCrossRefPubMed

Sousa, R., Paulo, M., Brazão, N., Castilho, P., & Rijo, D. (2022). Measuring compassion toward others: Dimensionality of the compassion scale in community adolescents and in adolescents with behavioral disorders. Psychological Assessment, 34(7), 631–642. https://doi.org/10.1037/pas0001133CrossRefPubMed

Spielberger, C. D. (1973). Manual for the state trait anxiety inventory for children. Consulting Psychologists Press.

Spitzer, R. L., Kroenke, K., Williams, J. B. W., & Lowe, B. (2006). A brief measure for assessing generalized anxiety disorder: The GAD-7. Archives of Internal Medicine, 166(10), 1092. https://doi.org/10.1001/archinte.166.10.1092CrossRefPubMed

Stalikas, A., & Flora, A. (2012). Depression Anxiety Stress Scale (DASS). In A. Stalikas, S. Triliva, & P. Roussi (Eds.), Psychometric instruments in Greece (2nd ed., p. 247). Pedio.

Stalikas, A., & Lakioti, A. (2012). Satisfaction with Life Scale (SWLS). In A. Stalikas, S. Triliva, & P. Roussi (Eds.), Psychometric instruments in Greece (2nd ed., p. 752). Pedio.

Stolow, D., Zuroff, D. C., Young, J. F., Karlin, R. A., & Abela, J. R. (2016). A prospective examination of self-compassion as a predictor of depressive symptoms in children and adolescents. Journal of Social and Clinical Psychology, 35(1), 1–20. https://doi.org/10.1521/jscp.2016.35.1.1CrossRef

Strauss, C., Lever Taylor, B., Gu, J., Kuyken, W., Baer, R., Jones, F., & Cavanagh, K. (2016). What is compassion and how can we measure it? A review of definitions and measures. Clinical Psychology Review, 47, 15–27. https://doi.org/10.1016/j.cpr.2016.05.004CrossRefPubMed

Sutton, E., Schonert-Reichl, K. A., Wu, A. D., & Lawlor, M. S. (2018). Evaluating the reliability and validity of the self-compassion scale short form adapted for children ages 8–12. Child Indicators Research, 11(4), 1217–1236. https://doi.org/10.1007/s12187-017-9470-yCrossRef

Terwee, C. B., Bot, S. D., de Boer, M. R., van der Windt, D. A., Knol, D. L., Dekker, J., Bouter, L. M., & de Vet, H. C. (2007). Quality criteria were proposed for measurement properties of health status questionnaires. Journal of Clinical Epidemiology, 60(1), 34–42. https://doi.org/10.1016/j.jclinepi.2006.03.012CrossRefPubMed

Ustun, T. B., Kostanjesek, N, Chatterji, S, Rehm, J., & World Health Organization. (2010). Measuring health and disability: manual for WHO Disability Assessment Schedule (WHODAS 2.0). https://iris.who.int/handle/10665/43974

Wentzel, K. R. (1993). Does being good make the grade? Social behavior and academic competence in middle school. Journal of Educational Psychology, 85(2), 357–364. https://doi.org/10.1037/0022-0663.85.2.357CrossRef

Yarnell, L. M., Stafford, R. E., Neff, K. D., Reilly, E. D., Knox, M. C., & Mullarkey, M. (2015). Meta-analysis of gender differences in self-compassion. Self and Identity, 14(5), 499–520. https://doi.org/10.1080/15298868.2015.1029966CrossRef

Zessin, U., Dickhäuser, O., & Garbade, S. (2015). The Relationship Between Self-Compassion and Well-Being: A Meta-Analysis. Applied Psychology. Health and Well-Being, 7(3), 340–364. https://doi.org/10.1111/aphw.12051CrossRefPubMed

Zhou, H., Wang, Y., Ding, J., Wang, Y., & Pan, J. (2019). Development and validation of an age- appropriate Self-Compassionate Reactions Scale for Children (SCRS-C). Mindfulness, 10(11), 2439–2451. https://doi.org/10.1007/s12671-019-01220-9CrossRef

Titel: A Systematic Review of Self-report Measures Used to Assess Compassion in Children and Young People
Auteurs: Jasmine Hubbard
Tamara Leeuwerik
Clara Strauss
Publicatiedatum: 23-04-2025
Uitgeverij: Springer US
Gepubliceerd in: Mindfulness
Print ISSN: 1868-8527
Elektronisch ISSN: 1868-8535
DOI: https://doi.org/10.1007/s12671-025-02569-w

Bohn Stafleu van Loghum

Welkom bij Erasmus MC & Bohn Stafleu van Loghum

Registreer

Login

A Systematic Review of Self-report Measures Used to Assess Compassion in Children and Young People

Abstract

Objectives

Method

Results

Conclusions

Preregistration

Supplementary Information

Publisher's Note

Method

Eligibility Criteria

Search Strategy

Assessment of Quality

Results

Selected Studies

Review of Identified Measures

Compassionate Engagement and Action Scales–Adolescence (CEAS-A)

Self-Compassion Scale (SCS)

Self-Compassion Scale–Youth (SCS-Y)

Compassion Scale–Child (CS-C)

Compassion Scale–Adolescence (CS-A)

Self-Compassion Scale–Child (SCS-C)

Shortened Self-Compassion Scale–Adolescence (S-SCS-A)

Self-Compassionate Reactions Scale–Child (SCRS-C)

Discussion

Strengths Limitations and Future Research

Declarations

Conflict of Interest

Publisher's Note

Onze productaanbevelingen

BSL Psychologie Totaal

BSL Academy Accare GGZ collective

BSL GOP_opleiding GZ-psycholoog

Supplementary Information

Bohn Stafleu van Loghum

Welkom bij Erasmus MC & Bohn Stafleu van Loghum

Registreer

Login

Deel dit onderdeel of sectie (kopieer de link)

Abstract

Objectives

Method

Results

Conclusions

Preregistration

Supplementary Information

Publisher's Note

Method

Eligibility Criteria

Search Strategy

Assessment of Quality

Results

Selected Studies

Review of Identified Measures

Compassionate Engagement and Action Scales–Adolescence (CEAS-A)

Self-Compassion Scale (SCS)

Self-Compassion Scale–Youth (SCS-Y)

Compassion Scale–Child (CS-C)

Compassion Scale–Adolescence (CS-A)

Self-Compassion Scale–Child (SCS-C)

Shortened Self-Compassion Scale–Adolescence (S-SCS-A)

Self-Compassionate Reactions Scale–Child (SCRS-C)

Discussion

Strengths Limitations and Future Research

Declarations

Conflict of Interest

Publisher's Note

Deel dit onderdeel of sectie (kopieer de link)

Onze productaanbevelingen

BSL Psychologie Totaal

BSL Academy Accare GGZ collective

BSL GOP_opleiding GZ-psycholoog

Supplementary Information