Skip to main content
Data in Brief logoLink to Data in Brief
. 2021 May 30;37:107184. doi: 10.1016/j.dib.2021.107184

Data on structure and farming practices of French organic vegetable farms, with focus on the use of inputs and the socio-economic context

Antonin Pépin a,b,, Kevin Morel c, Hayo MG van der Werf b
PMCID: PMC8207183  PMID: 34169126

Abstract

Organic vegetable farming systems in France have diverse farm structures, farming practices and socio-economic contexts. From April-July 2019, Pépin et al. [1] surveyed 165 farms using an online form. The questions about farming practices or socio-economic context did not require quantitative responses to make them simple and easy to answer. From a list of practices, farmers were asked which one(s) they used most often. Using decision rules, the answers were transformed into variables that are suitable for multivariate analysis. The data set also contains analysed data, including composite indexes derived from survey answers, as well as the number of the cluster to which each farm belonged, created after multivariate analysis and clustering performed on the data set.

Keywords: Organic farming, Vegetable farms, Farming diversity, Farming practice, Agroecology, Multivariate analysis


Specifications Table

Subject Agricultural Sciences
Specific subject area Organic vegetable production systems, Agroecology
Type of data Table
How data were acquired Online survey
Data format Raw and analysed data
Parameters for data collection The survey included only farms that produced organic vegetables for the fresh market as their main production. We focussed our survey on two contrasting regions of France: the north-west (Brittany, Normandy, Pays de la Loire) and south-east (Provence-Alpes-Côte-d'Azur, Languedoc-Roussillon). Because of word-of-mouth communication, however, some farmers in other regions answered the survey.
Description of data collection Data were collected using an online survey, made with Google Forms, sent to farmers from April-July 2019. The online survey was disseminated to the farmers through several networks − specialised in organic farming or not − including local agricultural development organisations and commercial organisations, to capture as many farm types as possible. Follow-up e-mails and phone calls were made regularly based on the responses collected, to ensure that sampling was as complete as possible.
Data source location Country: France
Data accessibility Raw and analysed data are deposited in a public repository
Repository name: INRAE dataverse (https://data.inrae.fr/)
Data identification number: 10.15454/YAXXYH
Direct URL to data: https://doi.org/10.15454/YAXXYH
Related research article A. Pépin, K. Morel, H.M.G. van der Werf, Conventionalised vs. agroecological practices on organic vegetable farms: investigating the influence of farm structure in a bifurcation perspective, Agricultural Systems 190, 103129. https://doi.org/10.1016/j.agsy.2021.103129

Value of the Data

  • The data set contains information on farming practices, with focus on the use of inputs and socio-economic issues in organic vegetable production, as well as farm and farmer characteristics.

  • The data set can be used by other researchers who work on organic practices in vegetable farming from an agro-ecological perspective.

  • These data can be used for future research on the relation between farm structure, farming practices in organic vegetable production and socio-economic elements.

  • This data set includes data from 165 farms which represent a diverse sample.

1. Data Description

The data reported in this data paper derive from a survey of farm structure, farming practices and socio-economic context conducted in France based on 165 organic vegetable farms. The data set is composed of 1 text document, 3 PDF documents and 6 Excel files that contain raw or analysed data (Table 1). It is hosted on INRAE repository and it includes the survey form, the answers as raw data and analysed data. Pépin et al. [1] provide details about the method used to analyse data. Answers that were open-ended or contained personal data (e.g. name, e-mail, phone number) were excluded from the answer files.

Table 1.

Contents of the data set.

File name Description
A0_README_description_of_files.txt Description of the files provided in the archive
A1_Codebook_Variable_information.csv Table that presents the variables and their short names, full names, type (quantitative or categorical), units, possible values and details
A2_Codebook_Categorical_Variable_values.csv Table that provides details about and explains the possible values taken by categorical values
B1_Survey_form_Fr.pdf Original survey form created with Google Forms (in French)
B2_Survey_form_Eng.pdf Survey form created with Google Forms, translated to English
C1_Survey_answers_Fr.csv Original survey answers (in French)
C2_Survey_answers_Eng.csv Survey answers, translated to English
D_Decision_rules.pdf Document explaining how the survey answers were transformed into variables in the data set
E_Dataset.csv Data set created based on the survey answers
F_Composite_indexes.csv Data set with the values of the composite indexes calculated according to Pépin et al. [1]

The results of the multivariate analysis conducted on the data set are presented in Table 2 and Fig. 1, Fig. 2, Fig. 3. It includes coordinates of the variables on the six principal components retained for the clustering, the correlation circle for the quantitative variables and the representation of the categories of the categorical variables on the first two principal components, and the associations between quantitative and categorical variables.

Table 2.

Coordinates of the variables on the six principal components.

Variable Dim.1 Dim.2 Dim.3 Dim.4 Dim.5 Dim.6
Years 0,4744189560 0,0002729537 0,0000089979 0,0055371181 0,0171702803 0,0484797620
Total_Area 0,4816120530 0,1951894910 0,0212584800 0,0000214685 0,0064296329 0,0016749360
Area_Field 0,4832810650 0,2516595948 0,0019664960 0,0025306635 0,0084659622 0,0266581040
Area_Sheltered 0,3640788340 0,2482546602 0,0856935000 0,0606381536 0,0008848295 0,0012093140
Manpower_FTE 0,6603791330 0,0120947450 0,0000826780 0,0615448804 0,0071853753 0,0155653500
Tractors 0,7219243690 0,0714604090 0,0204398900 0,0178606686 0,0026845719 0,0094871850
Ratio_Shelter_Field 0,0297155540 0,4469441867 0,1289005000 0,0187018880 0,0119505998 0,0008926270
Nb_Veg 0,2262670220 0,0241374533 0,2128739000 0,0108485159 0,0117312376 0,0205301750
Diversification 0,0090462030 0,0299649219 0,0009783325 0,0170704049 0,0030599415 0,2846180570
Fertilization 0,0191671450 0,3431179387 0,0283827600 0,0193806616 0,1888816132 0,0261087680
Tillage 0,2696908290 0,1371368889 0,4099002000 0,0512841738 0,1201344743 0,1305419100
Weeding 0,1364180930 0,0874620224 0,3041534000 0,3033157781 0,0692559995 0,0702891130
Pests_Diseases 0,2313476070 0,0991839132 0,0870709400 0,2884497764 0,1037189664 0,0771766340
Seeds_Seedlings 0,2111297320 0,1064088337 0,0780870600 0,0868591718 0,2238424723 0,1048908640
Food_Supply_Chain 0,4973274600 0,0604273794 0,2077630000 0,1879828808 0,3752937908 0,1597672070
Sales 0,7813459380 0,2424041787 0,3375564000 0,4108268194 0,2803084414 0,3520761180
Conv_Organic 0,2384361720 0,0490676290 0,0403923500 0,0017559832 0,0184547693 0,0409591350

Fig. 1.

Fig. 1

Correlation circle for the quantitative variables on the first two principal components.

Fig. 2.

Fig. 2

Representation of the categories of the categorical variables on the first two principal components.

Fig. 3.

Fig. 3

Associations between quantitative (blue) and categorical (red) variables on the first two principal components.

2. Experimental Design, Materials and Methods

Data were collected using an online survey sent to farmers from April-July 2019. The survey was carried out in two French regions with contrasting types of vegetable production: the north-west (Brittany, Normandy, Pays de la Loire) and south-east (Provence-Alpes-Côte-d'Azur, Languedoc-Roussillon).

The survey targeted farms that produced organic vegetables for the fresh market as their main production. The online survey, designed using Google Forms, was disseminated to farmers through several networks, including local agricultural development organisations and commercial organisations. Farmers completed 174 surveys, 165 of which were sufficiently complete to be included in the data set. In particular, we excluded six farms created in 2019 (i.e. less than one year of experience) and three farms that were not professional farms. Most farmers who answered the survey were located in the two targeted regions, but because of word-of-mouth communication, some farmers in other regions answered it.

The survey's questions focussed on farm structure, farming practices and socio-economic issues. The questions were divided into six categories:

  • -
    Farm history and geography
    • Farm age
    • Years since the farm was labelled “organic”
    • Location (administrative department)
  • -
    Land
    • Utilised agricultural area including non-cultivated areas
    • Area cultivated in vegetables, whether outdoors or sheltered (high plastic tunnels or multi-span greenhouses)
  • -
    Human and mechanical labour resources
    • Number of people working permanently or temporarily (labour)
    • Number of tractors
  • -
    Production
    • Number of different vegetables grown: farmers were asked to count the types of vegetables distinguished by consumers and marketing, regardless of their botanical species [2]. For example, cauliflower and kale are two different vegetables, as are green beans and dried beans. No distinction was made between varieties. Lettuce (e.g. Batavia, oakleaf) counted as one vegetable type. Potatoes and strawberries were considered vegetables.
    • Other types of production besides vegetables
  • -
    Farming practices
    • Type of tillage and tools used
    • Main practices to manage soil fertility
    • Main practices to control weeds
    • Main practices to control pests and diseases
    • Actions to protect or promote local biodiversity
    • Origin of seeds and seedlings
  • -
    Economy and selling strategy
    • Marketing supply chains
    • Destination of the vegetables sold (from local to export markets)
    • Annual revenue

As detailed online surveys that take too much time to fill out may deter the people targeted, the questions about farming practices or socio-economic context did not require quantitative responses, in order to make them easier to answer. In most cases, farmers were asked multiple-choice questions about which practices they used most often.

The answers were transformed into variables according to decision rules, as explained in Decision_rules.pdf. Missing answers were imputed using regularised iterative algorithms [1]. The variables and imputed values are available in Dataset.csv. The variables are suitable for statistical analyses such as multivariate analyses.

A subset of variables was transformed into normalised primary indicators. An additive combination of these indicators was calculated, which yielded composite indexes [1]. The values of these indicators and indexes for each farm are shown in Composite_indexes.csv.

Based on the data set, a farm typology was developed using Factor Analysis of Mixed Data and agglomerative hierarchical clustering (AHC) [1]. The resulting farm clusters are shown in Composite_indexes.csv and described by Pépin et al. [1]. Table 2 provides coordinates of the variables on the six principal components retained for the AHC.

Figs. 1 and 2 respectively represent the correlation circle for the quantitative variables and the categories of the categorical variables on the first two principal components. Fig. 3 represents the associations between quantitative and categorical variables.

Ethics Statement

All data were analysed anonymously. The farmers participated in the survey voluntarily and have agreed in writing to publication of the anonymised survey data for research purposes.

CRediT Author Statement

Antonin Pépin: Conceptualization, Methodology, Software, Writing - original draft preparation; Kevin Morel: Conceptualization, Writing - reviewing & editing; Hayo van der Werf: Conceptualization, Writing - reviewing & editing.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships which have, or could be perceived to have, influenced the work reported in this article.

Acknowledgments

The authors are grateful to the farmers who answered the survey and to the people who helped disseminate it.

Footnotes

Supplementary material associated with this article can be found in the online version at doi:10.1016/j.dib.2021.107184.

Appendix. Supplementary materials

mmc1.txt (1.7KB, txt)
mmc2.csv (4.8KB, csv)
mmc3.csv (5.5KB, csv)
mmc4.pdf (278.8KB, pdf)
mmc5.csv (132.1KB, csv)
mmc6.csv (109.8KB, csv)
mmc7.pdf (227.7KB, pdf)
mmc8.csv (28.6KB, csv)
mmc9.csv (6.6KB, csv)
mmc10.pdf (179.6KB, pdf)

References

  • 1.Pépin A., Morel K., van der Werf H.M.G. Conventionalised vs. agroecological practices on organic vegetable farms: investigating the influence of farm structure in a bifurcation perspective. Agric. Syst. 2021;190 doi: 10.1016/j.agsy.2021.103129. [DOI] [Google Scholar]
  • 2.Morel K., Leger F.G. A conceptual framework for alternative farmers' strategic choices: the case of French organic market gardening microfarms. Agroecol. Sustain. Food Syst. 2016;40:466–492. doi: 10.1080/21683565.2016.1140695. [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

mmc1.txt (1.7KB, txt)
mmc2.csv (4.8KB, csv)
mmc3.csv (5.5KB, csv)
mmc4.pdf (278.8KB, pdf)
mmc5.csv (132.1KB, csv)
mmc6.csv (109.8KB, csv)
mmc7.pdf (227.7KB, pdf)
mmc8.csv (28.6KB, csv)
mmc9.csv (6.6KB, csv)
mmc10.pdf (179.6KB, pdf)

Articles from Data in Brief are provided here courtesy of Elsevier

RESOURCES