Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2025 Dec 22;15:45087. doi: 10.1038/s41598-025-31882-6

Validation and psychometric analysis of the Italian version of the Behavior Identification Form (BIF) in healthy adults

Elisa Ravizzotti 1, Mirko Zitti 2, Virginia Sacchi 3, Simona Bisoffi 4, Giulia Cipriani 5, Susanna Mezzarobba 1,6, Sara Terranova 7, Alessandro Botta 2, Martina Putzolu 1,2, Gaia Bonassi 1, Carola Cosentino 1, Alessandra Finisguerra 8, Cosimo Urgesi 8,9, Laura Avanzino 2,7, Elisa Pelosin 1,2,
PMCID: PMC12749459  PMID: 41423549

Abstract

The Behavior Identification Form (BIF) was proposed by Vallacher to assess the individual level of abstraction to identify actions according to his concept of Action Identification Theory. This study aimed to validate the Italian version of BIF and assess its validity and reliability in healthy adults. The original 25-items BIF was translated into Italian and administered to 359 participants. Statistical analysis of the psychometric properties was conducted following the COSMIN guidelines for validity (Confirmatory Factor Analysis, CFA), reliability indices (internal consistency [Cronbach’s alpha]; test–retest reliability [Intraclass Coefficient Correlation]) and measurement error [Standard Error of Measurement (SEM), Minimal Detectable Change, (MDC)]). The BIF was successfully adapted into Italian. The one-dimensionality fit model was satisfied with indices of Root Mean Square Error of Approximation RMSEA = 0.03; Standardized Root Mean Square Residual SRMR = 0.05; Comparative Fit Index CFI = 0.92; Tucker-Lewis Index TLI = 0.91. The internal consistency revealed a Cronbach coefficient of 0.857 and a test–retest reliability showed an Intraclass Coefficient Correlation of 0.925. A SEM of 6.28 points and an MDC of 6.95 points were obtained. This study provided validity and reliability of the Italian version of the BIF in assessing action-identification levels in healthy adults. Future research is needed to explore its application in clinical settings.

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-025-31882-6.

Keywords: Outcome assessment, Psychometrics, Construal level, Action identification, Agency

Subject terms: Cognitive neuroscience, Human behaviour, Outcomes research

Introduction

Human social interactions require the ability to engage effectively with both the environment and other individuals1. To develop this aptitude, individuals must interpret and understand situations, as well as their own and other’s actions25 Interpersonal differences in decoding actions, which involve synthesis of movement kinematics and setting functional goals, could find roots in the theory of action identification68.

This theory emphasizes the integration of motor and cognitive aspects in the formulation of action identification, based on the level of personal agency. Indeed, it has been postulated that individuals differ in their tendencies to understand and describe actions. Someone focuses on the details or mechanisms to describe the action (i.e., the “how”), concentrating on the modalities of execution (e.g., how to drive a car), while others emphasize the consequences (i.e., the “why”) focusing on the broader implications (e.g., drive a car to travel). The former perspective defines concrete, low-level characteristics of actions, resulting in overt description, while the latter relies on abstract, high-level characteristics from covert information. Each action could be described from both perspectives, and this dualism could coexist in every individual, varying dynamically across situations, with one trait tenting to emerge.

The modality by which humans could capture information from the external world is associated with different hierarchically organized levels of action abstraction. These levels are influenced by previous individual experiences and can be modulated by the ambiguity of context, beyond the difficulty of execution6. Therefore, an abstract level of action identification might enhance performance, which can be captured as an automated gesture through full knowledge and awareness of the action itself9, and is linked to interoceptive experience10. In contrast, when the experience is poor, more details are required to identify an action, resulting in a lower concrete level of detection, relying on sensorimotor experience11.

Although the first formulation of the Action Identification Theory is rather outdated, its concept has evolved over the years. In particular, it has been integrated with the concept of levels of abstraction12, and is now a relevant issue in both social and personal contexts, offering valuable insight into diverse aspects of human behavior13,14. Current evidence suggests that, in our complex model of society, feelings of autonomy influence the level of action identification. When individuals feel empowered and autonomous, they tend to attribute more abstract meaning to their activities, acting in a purpose-oriented manner that promote greater efficiency. Conversely, reduced autonomy can lead to more concrete action identification, diminishing the sense of responsibility and affecting motivation, self-regulation, and well-being12.

Furthermore, individual levels of action identification can be altered by some psychological disorders. For example, individual with depression or anxiety tend to assign a more abstract identifications to negative than positive ones, with impacts on their emotional responses, leading to repetitive thought patterns, and hinders problem-solving abilities15. Evidence also suggests that chronic pain may influence the level of action abstraction. Persistent symptoms could interfere with daily activities, leading individuals to perceive their actions at lower, less meaningful levels, thereby affecting their global perception of life16.

Starting from the Action Identification Theory, Vallacher developed the Behavior Identification Form (BIF)7 as an instrument to measure the personal construct of action identification. This outcome measure consists of 25 items that propose fairly common activities to which meaning can be attributed by choosing an alternative within a dual response option, one describing an interpretation of actions based on motor aspects (low-level) and the other based on more cognitive aspects (high-level). The subjects’ level of action identification is defined by the number of high-level choices. The overall score, from 0 to 25, is used as an indicator of the level of action identification, associating higher scores with preferences for higher-level action identifications and, consequently, higher-level construal mindset17.

The use of BIF to assess principles of Action Identification Theory, in which mental representations and action are strictly related to each other, assumes relevance to deepen the dynamics of human emotional-motor-cognitive behaviors immersed in different social contexts9.

Indeed, this measure may allow assessing the individual’s tendency to read the reality, as in a Turkish study in which BIF measured the construal level18 defined as the psychologically distance towards an action. Currently, there is no validated Italian version of the BIF. Introducing this instrument into the Italian context would address an existing gap by providing a culturally appropriate tool. Once its psychometric properties are established, the Italian version of the BIF could be used in both research and clinical settings involving Italian populations.

On these bases, the main aim of this study was to validate the Italian version of the BIF and assess its psychometric proprieties, in the domains of validity and reliability, in a sample of healthy adults. Secondary goal was to also explore the technical characteristics of this instrument.

Material and methods

This cross-sectional study was developed from the collaboration between the University of Genoa, the University of Udine and the IRCCS E. Medea. The first stage of the project was dedicated to the translation in Italian language of the BIF, the second one to the validation process and analysis of measurement properties of the instrument.

The second stage of this study was designed following the Consensus-based Standards for the selection of health Measurement Instruments (COSMIN) guidelines and the full reporting adhered to its format19. The study was conducted in accordance with legal and ethical international standards outlined in the updated Declaration of Helsinki20 after requesting approval from the local Committee of the University of Genoa (CERA, Unige, approval 2024/80) and obtaining written informed consent from all participants who confirmed their agreement by checking the appropriate box on the first page of the Google Form online material before proceeding with the BIF items.

Instrument and translation process

The BIF is a self-administered questionnaire consisting in 25 items investigating the level of identification of familiar, common actions according to the Action Identification Theory. Participants were asked to choose between one of two alternative descriptors, classified as low or high level, of a defined action. High level responses contributed to generate the total score of the questionnaire, ranging from 0 to 25, and defining the level of action identification characterizing each subject.

Italian translation was conducted following the “Translation and Cultural Adaptation of Patient Reported Outcomes Measures-Principles of Good Practice” guidelines. The translation process included three steps: (1) two native English translators who were familiar with the examined topics have independently translated the questionnaire into Italian (forward translation); (2) one native Italian translator, aware of the subject matter, chose the best translation in order to create the final Italian version of the tool; and (3) a bilingual person with a certificated knowledge of the English language translated the text back into English (backward translation). The forward and backward translations were reviewed by qualified Italian and English-speaking clinicians before agreement on the final version. Final step consisted of having the approval of the Italian version by the main authors of the instrument.

Participants

Participants were recruited through a convenience sampling method, primarily via social media (e.g., WhatsApp groups, messaging function on social networks) and e-mails contacts, to facilitate broad and efficient data collection, from individuals across different regions of Italy. This procedure21,22 appeared appropriate, given the primary aim of the study. Participants were considered eligible based on the following inclusion criteria: (i) age over 18 years old, (ii) native Italian speaker or individual proficient in understanding the instructions and the questionnaire in Italian, (iii) access to an email address and internet, and (iv) self-declared absence of neurological or psychiatric disorders, confirmed through signed informed consent23. No gender-based restrictions were applied. Description of the sample structure and demographic characteristics is provided in the results section.

Procedures

The Italian version of the BIF questionnaire (Supplementary Information, S1) was administered using Google Forms. The link to the questionnaire was distributed via digital channels by project coordinators and collaborators, between the 19th September and the 20th October 2024. Responders were required to provide an email address, before starting to complete the BIF. This was necessary to send a follow-up link inviting participants to complete the questionnaire again, two weeks later, allowing for the assessment of test–retest reliability. Recruitment was conducted solely by the University of Genoa. The University of Udine and IRCCS E. Medea contributed to translation, data analysis, and manuscript writing.

Statistical analysis

Based on the literature, a sample of over 300 participants is considered good24 or adequate25 for validating measurement scales. Additionally, the COSMIN guidelines recommend a minimum of 7 participants per item-scale and at least 100 subjects for structural validity studies26. Finally, for assessing test–retest reliability and measurement error analysis, a sample size of 50 to 99 participants is considered satisfactory27.

Demographic characteristics of the sample were summarized using descriptive statistics, with mean and standard deviations for the continuous variables (age and education), and absolute frequencies with percentage for the dichotomous variables (gender).

Descriptive statistics for the total and for each BIF item were reported as frequency with percentage, whereas its psychometric properties were investigated specifically in the domains of validity and reliability as required by the COSMIN recommendation26.

Validity

The structural validity, defined as the degree to which an instrument’s scores accurately represent the dimensionality of the concept to be examined26, was analyzed to verify the one-dimensionality of the 25-items of the questionnaire through a Confirmatory Factorial Analysis (CFA) using the Weighted Least Squares Mean and Variance Adjusted (WLSMV) estimator, appropriate for the dichotomous survey items, within the Classical Test Theory (CTT). The analysis considered both absolute and relative indices related to a good fit model based on scaled statistics: chi-square test of model fit (χ2); Root Mean Square Error of Approximation (RMSEA) ≤ 0.06 for an excellent model fit, between 0.06 and 0.08 for an acceptable fit, with values up to 0.10 considered adequate28; Standardized Root Mean Square Residual (SRMR) deemed acceptable if ≤ 0.08; Comparative Fit Index (CFI) and Tucker-Lewis Index (TLI), both considered excellent for values > 0.95, or acceptable for the range between 0.90 and 0.9529,30. In addition, standardized factor loadings (β) for each item were reported to assess the adequacy of each indicator in representing the latent construct. According to literature, a standardized loading of 0.40 or higher is considered acceptable, indicating that the item contributes substantially to the measurement of the underlying factor. To control for the influence of multivariate outliers, we computed the Mahalanobis Distance (MD) for each participant31,32 MD evaluate how far an individual’s response pattern deviates from the multivariate center of the sample distribution. A critical value of χ225, 0.001 = 52.619 was used, corresponding to 25 degrees of freedom (equal to the number of BIF items) at a significance level of p < 0.001. Observations exceeding this threshold were to be flagged as multivariate outliers.

Reliability

The reliability was assessed in terms of internal consistency to verify the homogeneity and the interrelatedness of the all 25 items of the scale. Both Cronbach’s alpha (α) and Donald’s omega (ω) coefficients, the last more accurate for binary items, were used as the reference index for the one-dimensionality. Both coefficients used to be considered acceptable with values between 0.70 and 0.79, good in the range between 0.80 and 0.89, and excellent with values > 0.903335. Values above 0.950 might indicate redundancy within items. Additionally, if an item was deleted, a lower value of the α or ω coefficient, compared to the total alpha, or to the total omega respectively, was expected36. In addition, at the item-level analysis, Inter-Item and Item-to-Total correlations (i.e., IIC and ITC) were assessed by the Spearman’s rho correlation coefficient according to the psychometric validation procedures for self-report instruments with dichotomous items37. The IIC assess if items are positively related in directionally expected ways, the ITC check if each item contributes to the overall construct measured by the scale.

Average IIC was considered acceptable for values ranging from 0.15 to 0.5025, while ITC was considered acceptable for ≥ 0.25, and excellent for coefficients > 0.3038.

Test–retest reliability was checked by administering the questionnaire twice, approximately 2 weeks apart. The Intraclass Correlation Coefficient (ICC) using a 2-way random effects model with a 95% confidence interval (CI) and an absolute effect was used to evaluate the stability of the BIF results as proper methods to account for both agreement and consistency between repeated measures. For group measurements, ICC values were classified as poor if below 0.50, moderate between 0.50 and 0.75, good between 0.75–0.90 and excellent above 0.9039,40.

Lastly, the degree of uncertainty associated with the measurement error of two assessments was further explored by calculating the Standardized Error of Measurement (SEM) and the Minimal Detectable Change (MDC)41. The SEM was calculated using the following formula: SD xInline graphic where SD represents the standard deviation of the baseline data and ICC the value obtained from the test–retest reliability and the MDC Inline graphic which corresponds to the z-score equivalent to a 95% confidence level34,42.

BIF technical features, intended as feasibility and acceptability, were evaluated considering the ease of online dissemination and the average completion time by participants.

Data collected on the Forms platform were exported and organized in Excel files. Statistical analyses were performed using Jasp [v. 0.18.1]43 and Jamovi [v. 2.26]44 softwares.

Despite the brevity of the questionnaire and the voluntary nature of participation, we adopted both preventive and post-hoc strategies to mitigate the risk of careless responding, as recommended by Ward and Meade45. To reduce inattentive responding, we provided clear and standardized instructions in both the informed consent and on-screen prior to survey initiation. Completion of all items was mandatory to allow submission of the questionnaire. Additionally, we monitored completion times and excluded responses that were excessively fast or delayed, which could indicate insufficient engagement.

As part of post hoc quality control, we screened the dataset for response patterns indicative of low-effort behavior (e.g., consistently selecting the same binary option across all items). Furthermore, multivariate outlier analysis using Mahalanobis distance was conducted to identify atypical response profiles. These checks were performed prior to the psychometric analysis of the BIF.

Results

The process to define the pooled sample analyzed in this study is illustrated in Fig. 1. A total of 359 volunteers responded to the first administration of the questionnaire, and 232 responded to the second one. Participants had a mean age of 36.73 years (SD = 14.12), 35% (125 participants) of the sample were men, and the average level of education was 15.81 years (SD = 3.51).

Fig. 1.

Fig. 1

Flow chart of the study. The figure summarizes the steps of the study: in the first stage of the validation process, a link to access to the questionnaire was sent by mail to participants (359 questionnaires obtained); in the last stage, a second link to the questionnaire was sent to analyze the test–retest reliability (232 questionnaires obtained).

The mean total score, and the descriptive statistics for each item of the BIF are described in Table 1.

Table 1.

Descriptive statistics of the Italian version of the Behavior Identification Form (BIF).

Items Test Re-test
N (%) N (%) N (%) N (%)
1 263 (73) 96 (27) 179 (77) 53 (23)
2 75 (21) 284 (79) 47 (20) 185 (80)
3 198 (55) 161 (45) 118 (51) 114 (49)
4 163 (45) 196 (55) 112 (48) 120 (52)
5 138 (38) 221 (62) 88 (38) 144 (62)
6 84 (23) 275 (77) 171 (74) 61 (26)
7 169 (47) 190 (53) 101 (44) 131 (56)
8 288 (80) 71 (20) 182 (78) 50 (22)
9 114 (32) 245 (68) 58 (25) 174 (75)
10 214 (60) 145 (40) 142 (61) 90 (39)
11 228 (64) 131 (36) 147 (63) 85 (37)
12 85 (24) 274 (76) 57 (25) 175 (75)
13 297 (83) 62 (17) 179 (77) 53 (23)
14 148 (41) 211 (59) 82 (35) 150 (65)
15 151 (42) 208 (58) 107 (46) 125 (54)
16 298 (83) 61 (17) 188 (81) 44 (19)
17 175 (49) 184 (51) 122 (53) 110 (47)
18 115 (68) 244 (68) 80 (34) 152 (66)
19 119 (67) 240 (33) 86 (37) 146 (63)
20 332 (92) 27 (8) 210 (91) 22 (9)
21 157 (56) 202 (56) 111 (48) 121 (52)
22 231 (64) 128 (36) 70 (30) 162 (70)
23 180 (50) 179 (50) 114 (49) 118 (51)
24 179 (50) 180 (50) 110 (47) 122 (53)
25 221 (62) 138 (38) 136 (59) 96 (41)

The table showed the frequencies and percentage values of high- and low- level responses given by the entire sample of participants (359 test, 232 re-test) for each of the 25 items. In bold were indicated the score values of the high-level responses according to the original BIF form. N, numerosity.

Both genders showed a general tendency to interpret action with a high-level perspective, with 98 (78%) men and 154 (66%) women in particular choosing the more abstract option.

Structural validity analysis showed proper fit indices results to one-factor model from the CFA estimated with WLSMV. The one-dimensional model was expressed as χ2 = 334, df = 275; RMSEA = 0.025, 90% CI: 0.013—0.033; SRMR = 0.075; CFI = 0.977; and TLI = 0.975. The CFA revealed that item factor loadings ranged from 0.60 to 1.59 for unstandardized values and over 0.423 for standardized ones, with exception of items 19 (β = 0.39) and 22 (β = 0.28) that fall just below the cut-off. (Table 2). No multivariate outliers were identified based on the MD analysis, as no participant exceeded the critical value of χ2(25, 0.001) = 52.619. A remarkable good internal consistency (α = 0.857; ω = 0.858) was shown for the BIF (Table 2).

Table 2.

Standardized (β) and unstandardized factor loadings (Estimate) for each item on the latent factor (F1) from the confirmatory factor analysis (CFA) estimated with WLSMV.

Latent Observed Estimate SE 95% CI Lower 95% CI Upper β z p
F1 item1 1.000 0.000 1.000 1.000 0.469
item2 1.221 0.198 0.834 1.608 0.573 6.18  < .001
item3 1.160 0.189 0.791 1.530 0.544 6.15  < .001
item4 1.052 0.186 0.689 1.416 0.494 5.67  < .001
item5 1.374 0.206 0.970 1.778 0.644 6.66  < .001
item6 0.987 0.196 0.603 1.371 0.463 5.04  < .001
item7 1.329 0.203 0.930 1.727 0.623 6.54  < .001
item8 1.287 0.205 0.884 1.689 0.603 6.27  < .001
item9 1.273 0.203 0.875 1.671 0.597 6.27  < .001
item10 1.374 0.210 0.962 1.786 0.645 6.54  < .001
item11 1.430 0.220 1.000 1.861 0.671 6.51  < .001
item12 1.492 0.223 1.055 1.929 0.700 6.69  < .001
item13 0.940 0.196 0.556 1.324 0.441 4.80  < .001
item14 1.422 0.220 0.991 1.853 0.667 6.47  < .001
item15 1.333 0.199 0.942 1.724 0.625 6.68  < .001
item16 1.593 0.232 1.138 2.048 0.747 6.87  < .001
item17 1.435 0.220 1.004 1.865 0.673 6.53  < .001
item18 1.497 0.224 1.058 1.936 0.702 6.69  < .001
item19 0.838 0.160 0.524 1.151 0.393 5.24  < .001
item20 1.199 0.254 0.700 1.698 0.562 4.71  < .001
item21 1.126 0.187 0.760 1.492 0.528 6.03  < .001
item22 0.600 0.155 0.297 0.904 0.282 3.88  < .001
item23 1.355 0.217 0.929 1.781 0.635 6.23  < .001
item24 1.160 0.194 0.780 1.539 0.544 5.98  < .001
item25 1.373 0.218 0.945 1.801 0.644 6.28  < .001

All standardized loadings represent the strength of association between each observed item and the latent factor. F1, Latent factor; SE, Standard error; CI, confidence interval; β, Standardized factor loading.

The average of inter-item correlation was 0.192. From the analysis of the item-to-total correlations emerged coefficients higher than 0.25 for all items, except for item #22 (ITC = 0.197). Cronbach’s alpha-if-item-deleted was solid, as suggested that the deletion of almost all items did not increase α, ranged from 0.848 to 0.855, apart from item #22 (α = 0.858 or ω = 0.859 if item deleted). Similarly, values of Donald’s omega confirmed the result with other values between 0.849 to 0.856 (Table 3).

Table 3.

Item descriptive statistics, and internal consistency with Cronbach’s α and Donald’s ω values of the Italian version of the BIF.

Item Mean % (± SD) Item-to-total correlation Cronbach’s alpha (α ) if item deleted Donald’s omega (ω) if item deleted
1 73 (± 4.43) 0.326 0.854 0.855
2 79 (± 4.07) 0.386 0.852 0.853
3 55 (± 4.98) 0.399 0.852 0.853
4 45 (± 4.99) 0.358 0.853 0.854
5 38 (± 4.87) 0.460 0.850 0.851
6 77 (± 4.24) 0.307 0.854 0.856
7 47 (± 5.00) 0.456 0.850 0.851
8 80 (± 3.99) 0.395 0.852 0.853
9 68 (± 4.66) 0.421 0.851 0.852
10 60 (± 4.91) 0.472 0.849 0.850
11 36 (± 4.82) 0.475 0.849 0.851
12 76 (± 4.26) 0.482 0.849 0.850
13 83 (± 3.79) 0.277 0.855 0.856
14 41 (± 4.93) 0.484 0.849 0.850
15 58 (± 4.94) 0.464 0.849 0.851
16 83 (± 3.76) 0.481 0.850 0.850
17 51 (± 5.01) 0.495 0.848 0.850
18 68 (± 4.67) 0.512 0.848 0.849
19 67 (± 4.71) 0.285 0.855 0.856
20 92 (± 2.64) 0.282 0.855 0.856
21 56 (± 4.97) 0.390 0.852 0.853
22 36 (± 4.80) 0.197 0.858 0.859
23 50 (± 5.01) 0.472 0.849 0.851
24 50 (± 5.01) 0.400 0.852 0.853
25 38 (± 4.87) 0.458 0.850 0.851
Recommended values  ≥ 0.250  < α  < ω

SD, Standard deviation. Statistics values outside the recommended cut-offs are in bold.

Test–retest reliability was calculated on 232 subjects. Results showed an excellent reliability with ICC(2,1) = 0.925 95% CI = 0.903–0.942), and two measurement error indices with a SEM of 6.28 points and MDC of 6.95 points.

The Italian version of BIF was completed by the recruited participants in an average time of 5 min and 39 s. Only one missing questionnaire were found among the data extracted. No indications of careless or inattentive responding were found in the remaining data. No problems of comprehension were signaled, nor troubles with the digital platform.

Discussion

This study had the goal to validate the Italian version of the BIF and test its validity and reliability properties, representing the first investigation into the classic psychometric properties of BIF in Italian healthy adults. The Italian BIF revealed a good one-dimensionality fit for the structure validity, a good internal consistency, and an excellent test–retest reliability. From our study it could be infer that BIF could be used systematically to monitoring trends towards high or low levels of identification behaviors in Italian healthy adults.

Previous research reported the use of translated versions of the BIF in other languages, as Dutch46, Turkish18, Japanese 47, and Chinese48 but translated versions were not find into citing papers and no process of validation of the instrument in these languages was reported. Only a French translation, administered online to a student sample, was found. In this case participants preferred higher level responses and BIF revealed a good internal consistency value (α = 0.79)49. Furthermore, evidence reported an extensive validation process, with focus on validity and reliability properties, only in Polish50.

Collecting 359 corrected responses of participants at the first administration of BIF and 232 in the second phase, the achievement of desired sample size according to the recommendation of the scientific literature was obtained.

Structural validity data analysis confirmed a good unidimensional fit model regarding the Action Identification Theory, aligning with the expected theoretical constructs, as for the original and the Polish versions of the BIF. The value of RMSEA and SRMR resulted appropriated, below the recommended thresholds; in particular RMSEA showed a narrow confidence interval indicating precise estimation and excellent absolute fit. In addition TLI and CFI reached an excellent threshold. In the Polish study, the author also investigated the structural validity of the questionnaire through an Exploratory Factor Analysis that converged to a one factor structure. These considerations supported our decision to retain the original structure without post hoc re-specification. CFA results were not affected by multivariate outliers, as none were identified using MD analysis. This suggests that within our healthy adult sample, no response patterns significantly deviated from the multivariate distribution, supporting the robustness of the model estimation. Reliability analyses reported a good internal consistency with a total α value of 0.857, reinforced by the robust value of the 0.858-ω coefficient, confirming the unidimensional structure of the BIF. Value of α coefficient appeared in line with the ones reported in the original study (α = 0.84) and in Polish version (α = 0.85). The IIC (0.192) fell within the acceptable range of values indicating that items were well correlated and measuring the same construct without be redundant.

Despite the result of the item #22 below the level of acceptability, the ITC value of 0.41 (cut-off > 0.200) was good and greater than values (0.35) reported both by the original and the Polish study. The Cronbach’s α and the Donald’s ω, if an item was deleted, was lower than the overall value suggesting that all items contribute to the generation of the instrument-score. Both the α and the ω coefficients of item #22 (Travelling a car) were an exception, with values of 0.58 and 0.59 respectively, indicating an insignificant increment of 0.01, lower than critical increment of 0.1. A similar pattern with the unique increment of the Cronbach’s α in the ITC analysis was found in the Polish study50. Although Item19 (β = 0.393) and Item22 (β = 0.255) fall just below the conventional threshold of 0.40 for standardized factor loadings51,52, the overall model fit indices and internal consistency metrics (e.g., Cronbach’s alpha and McDonald’s omega) were within acceptable to good ranges. This suggests that, although some items are weaker indicators, the scale as a whole reliably measures the intended construct. These findings are consistent with previous research on complex psychological constructs52 and highlight the need to evaluate both item-level and scale-level performance when validating instruments.

Regarding the test–retest analysis, the Italian version of BIF revealed excellent reliability with a ICC(2,1) (0.925) slightly higher than that of the Vallacher’s study (0.91). In addition, we investigated two other parameters associated with reliability not explored in previous studies, such as measurement error, finding a relative low SEM (6.28) below the standard deviation value (22.92), and MDC (6.95) over an interval of about 2 weeks. This time period was set at the duration used in the original study, but some participants took up to 15 days longer to give second responses. However, we considered that the construct was not spontaneously alterable in that time frame. The strong psychometric properties demonstrated by the Italian version of BIF outlined the potential practical utility in both research and applied settings of this outcome instrument.

Finally, the limited resources required, in terms of cost and time, to complete BIF made the instrument feasible in a straightforward manner and in particular the possibility of the online administration suggests the possibility of remote and repeated assessments and new development in digital health research also in large-scale and with longitudinal design.

Limitations and further research

There are some limitations in this study that are worth discussing. First, participants were enrolled through a convenience sampling method, which is more exposed, than random one, to the risk of selection bias, because the sample might be characterized by some aspects that are not typical of the larger population. For example, the relatively low average age of the sample might have hidden some age-related effects, considering that changes in the sensorimotor system might exert some influences on the choice of action identification. In addition, the web-based dissemination of access to the questionnaire, might have influenced the selection of participants, considering that the elderly might still be unfamiliar with technology. We could have remedied these aspects enrolling a larger number of participants than our cohort, which, however, satisfied abundantly the recommendations of the COSMIN for psychometric analyses. Another issue is the small number of demographic variables collected, which did not allow for a better detailed characterization and description of the sample. For these reasons, future research should consider employing larger and more representative samples to confirm and to extend the psychometric properties of the BIF across diverse populations, adding more variables (e.g. income, area of residence) to facilitate subgroup analyses, enhance external validity and the robustness of conclusions. Finally, another limitation was the reliance on self-reported health status without formal clinical screening. Nevertheless, this approach aligns with common practice in large-scale psychometric validation research and represents a reasonable compromise between methodological standards and feasibility. Our study lacked criterion validity analysis, which was elaborated in the Polish study comparing other measures relevant to the personal agency construct. Considering that the assessment was administered online, we chose to capture only one outcome measure to avoid the risk of excessive drops out or missing data.

In addition to a more systematic investigation in elderly, future studies should consider validating the BIF questionnaire and analyzing its responsiveness in clinical settings. As suggested by Watkins15, a defective action identification mechanism in terms of high- or low- level description can be detected in a transdiagnostic way in a wide range of conditions including autism53. Even if, to the best of our knowledge, there are no studies testing this issue, Marsh and colleagues54 assessed the relationship between the score at the Autism Quotient55 and the score at BIF in a sample of healthy adults. Their findings showed an inverse relation between autistic traits and the proportion of high-level responses in the BIF, thus supporting the importance of investigating the action identification mechanisms also in clinical samples with autism. Furthermore, it should be investigated also patients with Parkinson’s Disease in which the discrepancies between different levels of action identification and personal agency seem to be relevant.

The impact of the construct of personal agency in individual assessment justifies the use of the BIF as a valid and reliable outcome measure in healthy Italian adults.

Conclusion

The Italian version of the BIF proved to be a valid and reliable tool applied on healthy adults to assess own level of abstractness in action identification. Our results were in line with the good psychometric properties of the original scale7 and with the Polish50 version. By adhering to COSMIN guidelines and employing comprehensive psychometric analysis, including CFA, internal consistency, test–retest reliability, and measurement error indices, we have enhanced the methodological rigor compared to earlier studies.

Further studies are required to evaluate the introduction of this instrument of measure in clinical practice (e.g., in Parkinson’s Disease and Autism Spectrum Disorders) to assess some relevant characteristics as embodiment and sense of agency.

Supplementary Information

Acknowledgements

The authors thank the participants and institutions for their involvement in responding to the survey. LA, EP, MP are supported by #NEXTGENERATIONEU (NGEU) and funded by the Ministry of University and Research (MUR), National Recovery and Resilience Plan (NRRP), project MNESYS (PE0000006)—A Multiscale integrated approach to the study of the nervous system in health and disease (DN. 1553 11.10.2022). GB is supported by “RAISE—Robotics and AI for Socio-economic Empowerment” and has been supported by European Union—NextGenerationEU, and it was partially funded by the European Union—NextGenerationEU. However, the views and opinions expressed are those of the authors alone and do not necessarily reflect those of the European Union or the European Commission. Neither the European Union nor the European Commission can be held responsible for them. AF and CU were supported by grants from the Italian Ministry of Health (Ricerca Corrente2024, Scientific Institute, IRCCS E. Medea, to A.F.). This study was also partially supported by a grants Fondi per la Ricerca Corrente (2022–2024) from the Italian Ministry of Health to IRCCS Ospedale Policlinico San Martino.

Author contributions

Study conception and design of the work: E.R., E.P., A.F., C.U. Acquisition of data: E.R., M.Z, V.S., S.B., G.C., C.C., S.T., A.B., G.B. Analysis and interpretation of data: E.R., E.P., M.Z, V.S., S.B., G.C. Draft of the manuscript: E.R., E.P., L.A. Critical revision of the manuscript: E.P., L.A., A.F., C.U., S.M, M.P.

Data availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Declarations

Competing interests

The authors declare no competing interests.

Footnotes

The original online version of this Article was revised: The original version of this Article contained an error in the name of author Elisa Pelosin, which was incorrectly given as Pelosin Elisa.

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Change history

2/26/2026

A Correction to this paper has been published: 10.1038/s41598-026-42148-0

References

  • 1.Kelly, M. P., Kriznik, N. M., Kinmonth, A. L. & Fletcher, P. C. The brain, self and society: A social-neuroscience model of predictive processing. Soc. Neurosci.14, 266–276 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Kroczek, L. O. H., Lingnau, A., Schwind, V., Wolff, C. & Mühlberger, A. Observers predict actions from facial emotional expressions during real-time social interactions. Behav. Brain Res.471, 115126 (2024). [DOI] [PubMed] [Google Scholar]
  • 3.Van Der Weiden, A., Porcu, E. & Liepelt, R. Action prediction modulates self–other integration in joint action. Psychol. Res.87, 537–552 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Yovel, G. & O’Toole, A. J. Recognizing people in motion. Trends Cogn. Sci.20, 383–395 (2016). [DOI] [PubMed] [Google Scholar]
  • 5.Bolt, N. K., Poncelet, E. M., Schultz, B. G. & Loehr, J. D. Mutual coordination strengthens the sense of joint agency in cooperative joint action. Conscious. Cogn.46, 173–187 (2016). [DOI] [PubMed] [Google Scholar]
  • 6.Vallacher, R. R. & Wegner, D. M. What do people think they’re doing? Action identification and human behavior. Psychol. Rev.94, 3–15 (1987). [Google Scholar]
  • 7.Vallacher, R. R. Levels of personal agency: Individual variation in action identification. J. Personal. Soc. Psychol..57(4), 660-671 (1989). [Google Scholar]
  • 8.Wegner, D. M., Vallacher, R. R., Kiersted, G. W. & Dizadji, D. Action identification in the emergence of social behavior. Soc. Cogn.4, 18–38 (1986). [Google Scholar]
  • 9.Vallacher, R. R. & Wegner, D. M. Action Identification Theory. In Handbook of Theories of Social Psychology. 1, 327–348 (SAGE Publications Ltd, United Kingdom, 2012).
  • 10.Connell, L., Lynott, D. & Banks, B. Interoception: the forgotten modality in perceptual grounding of abstract and concrete concepts. Phil. Trans. R. Soc. B373, 20170143 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Villani, C., Lugli, L., Liuzza, M. T., Nicoletti, R. & Borghi, A. M. Sensorimotor and interoceptive dimensions in concrete and abstract concepts. J. Mem. Lang.116, 104173 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Cancino-Montecinos, S., Björklund, F. & Lindholm, T. Dissonance and abstraction: Cognitive conflict leads to higher level of construal. Euro. J. Soc. Psych.48, 100–107 (2018). [Google Scholar]
  • 13.Fini, C., Era, V., Rold, F. D., Candidi, M. & Borghi, A. M. Abstract concepts in interaction: the need of others when guessing abstract concepts smooths dyadic motor interactions. Royal Soc. Open Sci.8, 201205 (2021). [Google Scholar]
  • 14.Nico, D., Borghi A, M., Tummolini, L. & Daprati, E. Abstract concepts and simulated competition. Psychol. Res.88, 238–256 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Watkins, E. Dysregulation in level of goal and action identification across psychological disorders. Clin. Psychol. Rev.31, 260–278 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Richardson, H. & Morley, S. Action identification and meaning in life in chronic pain. Scand. J. Pain9, 1–10 (2015). [DOI] [PubMed] [Google Scholar]
  • 17.Fujita, K., Henderson, M. D., Eng, J., Trope, Y. & Liberman, N. Spatial distance and mental construal of social events. Psychol. Sci.17, 278–282 (2006). [DOI] [PubMed] [Google Scholar]
  • 18.Yetiser, B. Do All Roads Lead to Rome? The Moderating Role of Culture and Age in Predicting Construal Level on Machiavellianism/Bütün Yollar Roma’ya Mı Çıkar? Makyavelizmde Yorumlama Seviyesinin Belirlenmesinde Kültür ve Çağın Aracı Rolü. Journal of Yaşar University9, (2015).
  • 19.Gagnier, J. J., Lai, J., Mokkink, L. B. & Terwee, C. B. COSMIN reporting guideline for studies on measurement properties of patient-reported outcome measures. Qual. Life Res.30, 2197–2218 (2021). [DOI] [PubMed] [Google Scholar]
  • 20.World Medical Association Declaration of Helsinki. Ethical principles for medical research involving human subjects. JAMA310, 2191 (2013). [DOI] [PubMed] [Google Scholar]
  • 21.Etikan, I. Comparison of convenience sampling and purposive sampling. AJTAS5, 1 (2016). [Google Scholar]
  • 22.Topolovec-Vranic, J. & Natarajan, K. The use of social media in recruitment for medical research studies: A scoping review. J. Med. Internet Res.18, e286 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Boateng, G. O., Neilands, T. B., Frongillo, E. A., Melgar-Quiñonez, H. R. & Young, S. L. Best practices for developing and validating scales for health, social, and behavioral research: A primer. Front. Public Health6, 149 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Comrey, A. L. & Lee, H. B. A First Course in Factor Analysis (L. Erlbaum Associates, Hillsdale, 1992). [Google Scholar]
  • 25.Clark, A. Whatever next? Predictive brains, situated agents, and the future of cognitive science. Behav. Brain Sci.36, 181–204 (2013). [DOI] [PubMed] [Google Scholar]
  • 26.Mokkink, L. B. et al. The COSMIN checklist for assessing the methodological quality of studies on measurement properties of health status measurement instruments: An international Delphi study. Qual. Life Res.19, 539–549 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Mokkink, L. B. et al. COSMIN risk of bias checklist for systematic reviews of patient-reported outcome measures. Qual. Life Res.27, 1171–1179 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Kenny, D. A., Kaniskan, B. & McCoach, D. B. The performance of RMSEA in models with small degrees of freedom. Soc. Methods Res.44, 486–507 (2015). [Google Scholar]
  • 29.Hu, L. & Bentler, P. M. Cutoff criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives. Struct. Equ. Modeling6, 1–55 (1999). [Google Scholar]
  • 30.Kang, H. & Ahn, J.-W. Model setting and interpretation of results in research using structural equation modeling: A checklist with guiding questions for reporting. Asian Nurs. Res.15, 157–162 (2021). [Google Scholar]
  • 31.De Maesschalck, R., Jouan-Rimbaud, D. & Massart, D. L. The mahalanobis distance. Chemom. Intell. Lab. Syst.50, 1–18 (2000). [Google Scholar]
  • 32.Todeschini, R., Ballabio, D., Consonni, V., Sahigara, F. & Filzmoser, P. Locally centred mahalanobis distance: A new distance measure with salient features towards outlier detection. Anal. Chim. Acta787, 1–9 (2013). [DOI] [PubMed] [Google Scholar]
  • 33.Bland, J. M. & Altman, D. G. Cronbach’s alpha. BMJ Br. Med. J.314, 572 (1997). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Terwee, C. B. et al. Quality criteria were proposed for measurement properties of health status questionnaires. J. Clin. Epidemiol.60, 34–42 (2007). [DOI] [PubMed] [Google Scholar]
  • 35.Streiner, D. L., Norman, G. R. & Cairney, J. Health Measurement Scales: A Practical Guide to Their Development and Use (Oxford University Press, 2015). [Google Scholar]
  • 36.Basagni, B. et al. The unidimensionality of the five brain injury rehabilitation trust personality questionnaires (BIRT-PQs) may be improved: Preliminary evidence from classical psychometrics. Brain Inj.34, 673–684 (2020). [DOI] [PubMed] [Google Scholar]
  • 37.Schober, P., Boer, C. & Schwarte, L. A. Correlation coefficients: Appropriate use and interpretation. Anesth. Analg.126, 1763–1768 (2018). [DOI] [PubMed] [Google Scholar]
  • 38.DeVellis, R. Classical Test Theory. Medical Care44, S50–59 (2006)
  • 39.Post, M. W. What to do with “moderate” reliability and validity coefficients?. Arch. Phys. Med. Rehabil.97, 1051–1052 (2016). [DOI] [PubMed] [Google Scholar]
  • 40.Koo, T. K. & Li, M. Y. A guideline of selecting and reporting intraclass correlation coefficients for reliability research. J. Chiropr. Med.15, 155–163 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Weir, J. P. Quantifying test-retest reliability using the intraclass correlation coefficient and the SEM. J. Strength Cond. Res.19, 231 (2005). [DOI] [PubMed] [Google Scholar]
  • 42.Foundations of Clinical Research. Applications to practice. Surv. Ophthalmol.47, 598 (2002). [Google Scholar]
  • 43.JASP Team (2024). JASP (Version 0.19.3)[Computer software].
  • 44.The Jamovi Project. jamovi (Version 2.6). [Computer Software]. Retrieved from https://www.amovi.org (2025).
  • 45.Ward, M. K. & Meade, A. W. Dealing with careless responding in survey data: Prevention, identification, and recommended best practices. Annu. Rev. Psychol.74, 577–596 (2023). [DOI] [PubMed] [Google Scholar]
  • 46.Smith, P. K. & Trope, Y. You focus on the forest when you’re in charge of the trees: Power priming and abstract information processing. J. Pers. Soc. Psychol.90, 578–596 (2006). [DOI] [PubMed] [Google Scholar]
  • 47.Nakamura, J. et al. Effects of galvanic vestibular stimulation combined with physical therapy on pusher behavior in stroke patients: A case series. NRE35, 31–37 (2014). [Google Scholar]
  • 48.Gu, X. & Tse, C. Abstractness and desirableness in the human values system: Self-transcendence values are construed more abstractly, but felt more closely than are self-enhancement values. Asian J. Soc. Psychol.21, 282–294 (2018). [Google Scholar]
  • 49.Brown, D. N. Agency and motivation to achieve language-learning objectives among learners in an academic environment in France. Apples J. Appl. Lang. Stud.8, 101-126 (2014). [Google Scholar]
  • 50.Pilarska, A. Assessing individual variation in action identification: Translation, validation, and shortening of the polish behavior identification form (BIF). Psychol. Rep.123, 1470–1492 (2020). [DOI] [PubMed] [Google Scholar]
  • 51.Brown, T. A. Confirmatory Factor Analysis for Applied Research (The Guilford Press, 2015). [Google Scholar]
  • 52.Kline, R. B. Principles and Practice of Structural Equation Modeling (Guilford Press, 2011). [Google Scholar]
  • 53.Belayachi, S. & Van der Linden, M. Individual differences in cognitive representations of action influence the activation of goal concepts. Acta Psychol.142, 259-264 (2013). [Google Scholar]
  • 54.Marsh, A. A. et al. The neural substrates of action identification. Soc. Cogn. Affect Neurosci.5, 392–403 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Baron-Cohen, S., Wheelwright, S., Skinner, R., Martin, J. & Clubley, E. The autism-spectrum quotient (AQ): Evidence from Asperger syndrome/high-functioning autism, males and females, scientists and mathematicians. J. Autism Dev. Disord.31, 5–17 (2001). [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES