Skip to main content
PLOS One logoLink to PLOS One
. 2022 Aug 15;17(8):e0272225. doi: 10.1371/journal.pone.0272225

Appropriability and basicness of R&D: Identifying and characterising product and process inventions in patent data

Sebastian Heinrich 1,¤,#, Florian Seliger 1,2,*, Martin Wörter 1,¤,#
Editor: George Vousden3
PMCID: PMC9377595  PMID: 35969608

Abstract

We present a database that classifies all patent applications filed at either the United States Patent and Trademark Office (USPTO) or the European Patent Office (EPO) as being either product patents, process patents or ‘mixed patents’. We use the share of claims that refer to either product or process inventions which allows to classify all patent applications along a continuum of pure process patents and pure product patents. We find that process-oriented patents draw more on previous knowledge, are more original and more radical than product patents. Lower breadth of protection is positively associated with pure process patenting, whereas product and mixed variants can be protected more broadly. This characterisation uncovers heterogeneity of patented inventions that allows for a more sophisticated use of patent statistics. It can improve the accuracy of analyses, but also reveal new aspects related to property rights.

Introduction

In this study, we present a categorization of patents according to whether they reflect product or process inventions. We identify keywords in patent claims that are related to process inventions, which allows us to classify the universe of European Patent Office (EPO) and United States Patent and Trademark Office (USPTO) patents. We also characterize process and product patents based on fundamental patent characteristics. In particular, we identify differences in product and process inventions by simultaneously regressing them on indicators capturing appropriability and basicness of R&D.

The distinction between product and process patents and their characterization contribute to several empirical and theoretical research strands in the field of economics.

For instance, the cost-saving function of process inventions is a cornerstone in industrial economics [1], especially when analyzing market power [2], market entry [3], and innovation behavior as a function of competitive conditions [310], or when analyzing the impact of cost-saving inventions on the labor market [11]. The distinction between process and product inventions in patent statistics thus allows to test more theoretical models empirically.

The differentiation between product and process technologies also plays a key role for technological life cycles. Utterback and Abernathy (1975) [12] described the stylized properties of such life cycles, where product inventions dominate at the beginning when firms compete with a variety of products. Firms first try to be competitive by introducing new products at an early stage of a life cycle. As technologies mature, they compete mainly on prices, while products are changed only incrementally. At this stage, competitive pressure induces firms to reduce production costs. In order to remain competitive and to sell into mass markets, firms then introduce process inventions. In the same vein, Klepper (1996) [13] showed that the ability to appropriate returns from process R&D depends on firm size. When firms grow in mature industries, they have an increased incentive to leverage process innovation. Finally, Pisano (1997) [10] showed that firms’ investment in new products are followed by investment in processes in order to decrease production costs in price competition.

Regarding patenting behavior, firms are assumed to follow different strategies for product and process technologies. According to Cohen and Klepper (1996) [13], firms typically do not sell or license process patents within their industry. One reason might be that process patents are less effective in protecting the underlying intellectual property. In general, the leaking of knowledge to competitors is slower as with product patents which reduces the incentive of patenting [14]. Hall et al. 2010 [15] provide a review on what determines firms’ choice between patenting and secrecy to protect their intellectual property. They conclude that innovations’ characteristics, namely product versus process, as well as discrete versus complex, along with competition in the product market, are key factors for the propensity of choosing patenting or secrecy.

Many empirical studies use the framework of a knowledge production function to investigate the returns from knowledge accumulation within a company [1619]. They often use patent data to measure the knowledge stock, lacking the distinction between product and process-related knowledge. Hence, it is an open question whether potentially positive returns from knowledge accumulation come from process or product-related knowledge. So far, there have been smaller scale empirical studies that distinguish between those two types of knowledge. They focus on specific technologies, on specific types of companies [20, 21] or use survey information [22].

Despite the important role of the distinction between product and process related knowledge in economic theory, technological life cycles, the knowledge production function, and for patent strategies, there are relatively few empirical studies investigating process inventions [10, 2326].

The negligence of process inventions in empirical studies mainly results from the lack of data. One of the few research efforts include Scherer (1982) [27], constructing a patent-based matrix for technology flows between industries. Cohen and Klepper (1996) [13] used the same data to study the distinction between product and process data. Furthermore, there are investigations based on survey data, studying the direct performance effects of product vs. process innovations [22, 28]. There are only a few recent studies that try to distinguish between products and processes in patent claims [2931].

The main contribution of this paper is to provide a comprehensive categorization of patents according to whether they represent product or process inventions. The patent classification includes patent documents from two major patent offices, the EPO and the USPTO. We make the data available to the public under the following DOI https://doi.org/10.7910/DVN/CBSK2W so that future research can draw on this comprehensive database. The proposed classification allows for more extensive studies analyzing the impact of product and process knowledge on firm performance and thus addresses small sample issues [19].

Our descriptive analysis shows that product patents are much more common than process patents, but ‘mixed’ patents (patents with both product and process claims) have become more important over time. In many technologies, mixed patents have become the predominant form of patenting already in the nineties. We see rather large differences across countries and technologies. This reveals previously undiscovered differences in countries’ technological capabilities and orientations, which can increase our understanding of the importance of technologies for countries’ economic development.

A further contribution refers to the investigation of the relationship between fundamental patent characteristics and the likelihood that a patent is a pure process patent or mixed patent rather than a product patent. Given that there is only a vague understanding of how process patents differ from product patents and its importance for economic thinking, it is crucial to gain more knowledge about the differences from an empirical point of view.

The characteristics we investigate are indicators that are commonly used in patent studies and are thus well-established. We mainly focus on appropriability and basicness of R&D. According to Trajtenberg et al. (1997) [32] they are prominent sources of heterogeneity in the characterisation of R&D. We therefore expect that they are also essential for the disctinction between product and process inventions. Basicness refers to features of innovations such as originality or closeness to science, whereas appropriability refers to the ability of inventors to reap the benefits from their inventions.

Our empirical study shows that patents with a higher share of process claims seem to be in younger technologies and have a broader technological scope compared to product patents. They draw on previous knowledge to a larger extent than product patents and are more radical and original. Finally, smaller breadth is associated positively with process patenting, but negatively with patenting mixed types, which is partly in contrast to our expectations.

The first section explains the applied method to distinguish between product and process patents. The second section presents descriptive results and shows how the number of product and process patents has evolved over time. In the third section, we describe the variables we use to identify important differences between patent types and present econometric results of a multinomial logit estimation. The last section presents conclusions from this study.

Identification of product and process patents

Patents have multiple claims (in most of the cases between 5 and 50), and we classified each claim individually as belonging to either the process or the product class. Claim texts for EPO patents were obtained from the EPO backfile containing EP-A and EP-B documents from 1978 to 2016 and 1980 to 2016, respectively (see https://www.epo.org/searching-for-patents/data/bulk-data-sets/data.html#tab-2, accessed on 2019/09/27). Claim texts for USPTO patents were obtained from the ‘Patent and Patent Application Claims Research Dataset’ provided on the USPTO’s Bulk Data Storage System containing full-text claims from US patents granted between 1976 and 2014 and US patent applications published between 2001 and 2014 (see https://bulkdata.uspto.gov/, accessed on 2019/09/27). We focus on USPTO and EPO patent documents as they belong to the five largest patent offices in the world, therefore covering a great share of global patent families. The EPO was founded in 1977, so data has only become available in the aftermath. The end year was also chosen due to data availability at the time we began to work on the project. The introduced classification scheme has been developed within the European Patent Office Academic Resarch Programme [33].

The set of keywords we use to classify claims were determined by a manual search process in EPO patents. The search was done by a patent expert from the Fraunhofer Institute in Germany. However, the identification of product patents by keywords is incomplete as in many cases the labeling of the specific product is used instead of more abstract terms. In contrast, the identification of processes is quite complete if using the extracted keywords. Since patent applications at the EPO can be filed in English, German or French, and not all patent filings are available in English, we had to conduct keyword searches in all three languages because we apply the keyword search on all EPO documents.

In contrast to economic theory, where a process invention is usually an improved process to produce a good (which mostly implies cost reductions), the definition of process claims is much broader. The EPO examination guidelines include “all kind of activities in which the use of some material product for effecting the process is implied” [34]. According to the USPTO guidelines, process claims “include a new use of a known process, machine, manufacture, composition or material” [35]. In order to account for those “use” claims, we identified additional keywords that refer to the usage of something. In this paper, our definition of process claims comprise both processes, methods and use claims. Our database enables researchers to distinguish between processes and usages explicitly, but the share of use claims is too tiny in order to be relevant for statistical analyses. The extracted keywords to distinguish between product and process claims for all three languages are listed in Table 1. The latter includes typical process plus use claims. For USPTO patents, we use the same set of keywords only in English.

Table 1. Keywords for classification into product and process claims.

Typical product keywords Process keywords Use keywords
English device method use
machine process utilization
material procedure utilisation
tool usage
apparatus
vehicle
compound
composition
substance
article
German Vorrichtung Verfahren Verwendung
Einrichtung Methode Anwendung
Werkzeug Prozess Benutzung
Material Prozedur Nutzung
Apparat
Fahrzeug
Verbindung
Zusammensetzung
Substanz
Artikel
French outil procédé utilisation
machine méthode usage
support procedure
materiel processus
dispositiv
assemblage
véhicule
composé
composition
substance
article

We implemented the classification as an exclusion process, meaning that every claim was checked for the occurrence of one of the process keywords. If one of the keywords occurs, the claim is considered as belonging to the process class. On the contrary, if a claim does not contain any of the process keywords, it is considered as belonging to the product class. This keyword search takes into account that the set of product keywords is incomplete and therefore does not use them explicitly. In this way, an unambiguous classification of claims is possible: If a claim is not a process claim, it must be a product claim by definition (see also the examination guidelines [34, 35]). So-called ‘product-by-process’ claims are product claims as well, but the difference to a standard product claim is that the product they relate to is defined by the process for producing it. The classification of claims offers a granular classification of each patent by considering that a substantial part of patents contains both products and processes.

In order to apply the keyword search, we had to apply some heuristic rules. Most importantly, we had to limit the search to the first words of a claim. Otherwise we would have classified many product claims as processes by mistake. Details and examples are provided in (S1 File). For process claims, we use two different sets of rules. In the first one, we limit the keyword search to the first two words. In the second, we limit the search to the first five. In turn, for use claims, we limit the search to the first word of a claim.

After the classification had been completed, we computed the total number of claims, the number of product, process, and use claims, as well as the share of process claims and use claims for each patent. These numbers serve as the basis for the identification of product and process patents.

In order to ensure the validity of the applied classification, we mainly did two things: First, we classified thousands of claims manually and compared the classification result with results from the automated keyword search. Second, we repeated the classification using machine learning. In order to apply machine learning techniques, research assistants who studied engineering or natural sciences classified the claims from about 1’100 patents that have been granted and were randomly drawn from the Worldwide Statistical Patent Database (PATSTAT). This served as our training data. For details, see Banholzer et al. (2019) [33]. We compared the accuracies from keyword search and text mining where the keyword approach yielded a slightly higher accuracy (98 compared to 93). By extensive manual inspection, we also came to the conclusion that the keyword search approach was superior in capturing the distinction between the different classes. However, the two measures appear to be correlated to a very large extent when comparing their outcomes.

Results from keyword search

The use share per patent is very small and does not change significantly over time. Therefore, we will add the use share to the process share in this paper. This is in line with the examination guidelines that treat claims referring to the use of something as process claims. As already mentioned, product-by-process claims are in fact product claims. Consequently, we treat them as product claims throughout the analysis, i.e. we add them to a patent’s product share.

Concerning the process share, there has been a trend towards including more process claims since 1990. The share amounts to about 30% nowadays (about 22% in 1980). There is only a slight difference between process shares based on keyword searches in the first two words and in the first five words. In this paper, we use the more restrictive definition based on only the first two words.

Definition of product and process patent categories

For the subsequent analyses, we use a rather restrictive definition of process and product patents: If a patent filing only contains product claims (i.e., the share of product claims is 1), it is considered a product patent. If it only contains process or use claims or process and use claims at the same time, it is defined as process patent. Patents with both product and process-use claims are defined as ‘mixed patents’. This definition is admittedly narrow and we lose information on the exact share of process and product claims in mixed patents because there is a continuum of possibilities between the two polar cases ‘pure’ process patents and ‘pure’ product patents. However, it has the advantage of being clear-cut and that we do not need to establish arbitrary thresholds (such as a 50% rule etc.).

Descriptive results

In this section, we provide descriptive results for the development of product, process-use and mixed patents based on patents granted at either the EPO or USPTO with priority years between 1980 and 2010.

Figs 1 and 2 show the number of granted product, process, and mixed patents at the EPO and USPTO, respectively. At the EPO, the share of pure product patents is around 50%, at the USPTO, it is lower. The share of pure process patents is generally much lower and slightly decreasing at both offices. In contrast, the number of mixed patents has increased considerably over time. It has caught up with the number of product patents in recent years.

Fig 1. Count of product, process, and mixed patents, granted at the EPO.

Fig 1

Fig 2. Count of product, process, and mixed patents, granted at the USPTO.

Fig 2

There are hardly any studies that we can refer to in order to validate our results. An exception to this is an EPO study on the ‘Market success for inventions’, which presents results from interviews with SMEs on the type of a specific patented invention [36]. The shares of pure product, pure process, and mixed patents are very close to our figures (according to the survey, 47% of the patent applications refer to pure product inventions, 38% to inventions combining product and process features, and 15% to pure process inventions). The survey results might increase confidence in the classification approach and help alleviate concerns about the use of claim text (e.g., because they might reflect the examiners’ point of view rather than the firms’ inventions).

Looking at the average number of claims per patent, we can see that the number has increased from about 9 to 12 at the EPO and from 11 to 17 at the USPTO from 1980 to 2010. Claims can be distinguished between independent and dependent claims. The difference is that a dependent claim cannot stand alone, this means it references another claim (independent claim) that is directed to the essential features of the invention. Interestingly, the number of independent claims, which are the claims that can stand alone, has not changed by much; especially at the EPO, it does not show any upward trend. The increase in the number of dependent claims might be driven by strategic reasons, such that firms are trying to make their patents as broad and vague as possible in order to sue competitors that infringe the patent, or by legal requirements at the patent offices. Therefore, indicators based on independent claims might get closer to ‘true’ product or process shares of inventions. The share of product patents calculated based on only independent claims is higher at the EPO (between 61% and 67%, Fig 3). At the USPTO, product patents have lost their predominant position, but they are still more important than mixed patents when looking at only independent claims (Fig 4).

Fig 3. Count of product, process-use, and mixed patents, EPO, based on independent claims.

Fig 3

Fig 4. Count of product, process-use, and mixed patents, USPTO, based on independent claims.

Fig 4

There are also some interesting differences across inventor countries and technologies (see [33]). For complex technologies such as computer technology, the descriptive evidence delivers a clear picture: Mixed patents have become predominant, whereas product patents loose in significance. Pure process patents only play a minor role. The growing importance of computer technology in the U.S. might be a reason why we see such a large decrease in product patents at the USPTO.

The reasons for those developments remain unclear and require more detailed analyses, but it is likely that technologies have become more complex over time or that firms add additional process claims in order to fulfil legal requirements or for strategic reasons. The increase in mixed patents might be hardly attributed to cost reductions or improvements in the production process alone.

For technologies that are characterized by very high shares of product patents such as transport technologies, there is also a trend towards more mixed patents (and less product patents), perhaps indicating technological exhaustion within a technological paradigm [37], but the share of product patents is still much higher.

The relationship between fundamental patent characteristics and product and process patenting

Data and variables

In this section, we investigate the relationship between fundamental patent characteristics and the likelihood of getting a pure process-use or mixed patent granted in comparison to the likelihood of getting a pure product patent granted at either the EPO or USPTO. This analysis enables us to understand how product and process patenting is related to appropriability, basicness, the state of the technological life cycle, and technological scope. The used variables originate from the OECD Patent Quality Database [38], PATSTAT, and the database we have developed as outlined in this study. We provide an overview of our variables and technical description, as well as their sources in Table 2 and provide a summary of their intuition in the following paragraphs.

Table 2. List of variables.

Variable name Source Description
product_process_mixed Own data Categorical variable, 1 = product patent, 2 = process patent, 3 = mixed patent
product_process_mixed_ind Own data Categorical variable, 1 = product patent, 2 = process patent, 3 = mixed patent, based on independent claims
lnclaims_ind Own data Number of independent claims, natural logarithm
claims_per_ind_claim Own data Number of dependent claims per independent claim
lnpatent_scope OECD Patent Quality Number of distinct 4-digit IPC subclasses, natural logarithm
lnfamily_size OECD Patent Quality Number of patent offices at which a given invention has been protected normalised with respect to the maximum value exhibited by other patents in the same cohort, with cohorts that are determined by the pair technology–year, natural logarithm
lnnpl_cits OECD Patent Quality Number of NPL citations included in a patent divided by the maximum number of NPL ciations of patents belonging to the same year and technology cohort+1, natural logarithm
lnbwd_cits OECD Patent Quality Number of backward citations included in a patent divided by the maximum number of backward ciations of patents belonging to the same year and technology cohort+1, natural logarithm
originality OECD Patent Quality Between 0 and 1, the measure is high if a patent cites patents belonging to a wide range of IPC codes
radicalness OECD Patent Quality Count of IPC-4 digit codes of patent j cited in patent p that is not allocated to patent p, out of n IPC classes in the backward citations counted at the most disaggregated level available, normalised with respect to the total number of IPC classes listed in the backward citations
upward_tech_cycle Own data Patent was filed in a technology when technology was growing (1), when it was stable (0), when it was decreasing (-1)
lntech_age Own data Age of the technology when the patent was filed+1, natural logarithm
lnnumb_words_ind Own data Average number of word stems in independent claims, natural logarithm
lnnb_inventors PATSTAT Number of inventors
USPTO Own data Patent application was filed at the USPTO

The following variables, namely patent scope, family size, citations to non-patent literature, backward citations, originality and radicalness come from the quality indicators provided by the OECD [38]). They measure different dimensions of basicness and patent value. Patent scope is measured by distinct 4-digit IPC subclasses. A wider scope has been shown to affect valuations of firms where broader patents are more valuable when there are many available substitutes in a particular product class [39]. Family size is measured by the number of patent offices where an invention is protected where larger families are associated with higher values [40]. Non-patent literature citations can be applied to asses the extent to which a patent relies on scientific research. Patents relying more on scientific research are associated with significantly higher quality [41]. The number of backward citations can be used to assess the novelty of an invention disclosed in a patent and was also found to be positively associated with patents’ value [40]. Originality is measured by the breadth of cited IPC codes and thus measures how diversified the knowledge sources are [32]. Finally, the radicalness index proposed by Shane (2001) [42] is based on IPC classes that deviate from classes in the cited patents and thus measures the paradigm shift a patent brings about.

Furthermore, we construct a variable intended to capture technological life cycles in the context of product and process patenting. Within a prototypical technological life cycle, a firm devotes more and more effort into process inventions over time [12]. Our upward technology cycle variable indicates whether a patent belongs to a growing technology as measured by the growth of the number of patent applications in the respective technology at a given point in time. Another variable measures the age of a technology, i.e. the difference between the filing date of the respective patent and the first patent that appeared in a given technology. Technologies are defined as unique combinations of 4-digit codes from the International Patent Classification. A detailed explanation can be found in (S1 File).

Based on our own data, we use the number of independent claims to measure a patent’s breadth in terms of legal protection. As a higher number of claims incurs higher patent fees [38], they not only account for technical breadth, but also for applicants’ expectations of market value [43, 44]. We also add the number of dependent claims per independent claim which might reflect strategic considerations of applicants.

A further variable refers to the average number of words in the independent claims. According to Kuhn & Thompson (2019) and Marco, Sarnoff, & deGrazia (2019) [45, 46], this is a preferred measure of patent protection breadth. The process of claim narrowing during the examination procedure almost always involves adding words to the claim, i.e., a higher number of words indicates a lower patent breadth. In this paper, we refer to patent breadth in the sense of the breadth of legal protection and to patent scope in the sense of the technological scope of the invention.

Finally, we use the team size of inventors to explore potential distinctions in human capital between the two categories. Akcigit et al. (2018) [47] showed that the average team size is 2.3 inventors. Inventor teams are likely to generate more spillovers [48], reduce the cost of accessing shared resources [49], and ultimately increase productivity, especially if policies support the quality of inventor teams [50].

We also add a dummy indicating whether a patent has been filed at the USPTO (rather than at the EPO) and the respective inventor country weights and technology field weights. The reason that we add weights rather than dummies is that each patent can belong to multiple inventor countries and technologies. The weights capture the share of inventors in each country and the share of IPC codes belonging to a certain technological field accurately. Finally, we add filing year fixed effects.

Literature on patent quality and appropriability is rather silent with respect to the difference between process and product patents. Most importantly, patent protection of processes is much more difficult than patent protection of products and secrecy is often used as an appropriation mechanism for process inventions [15]. Therefore, the breadth of legal protection should be smaller for process and mixed patents patents compared to products patents. On the other hand, we would expect process and mixed patents to be slightly more complex as they dominate in so-called complex technologies. Hence, they should draw on science and predecessor inventions to a larger extent than product patents. They might also cover a larger number of technological fields, might have more claims and a larger number of patent family members. Adding process claims to product inventions and filing mixed patent applications can of course have different implications, e.g. that they are filed for strategic reasons.

Table 3 provides descriptive statistics for the variables used characterize our classification. The dependent variable is a categorical variable (product, process, or mixed patent), containing 51.91% product, 12.25% process and 35.84% mixed patents. When only considering independent claims, the variable contains 57.96% product, 13.95% process and 28.09% mixed patents.

Table 3. Descriptive statistics of variables.

Variable name Observations Mean sd min max
product_process_mixed 4993150 1.839291 .9228789 1 3
product_process_mixed_ind 4992865 1.70132 .8782247 1 3
lnclaims_ind 5008417 .6717842 .6288613 0 5.442418
claims_per_ind_claim 5008417 7.423275 5.432932 1 887
lnpatent_scope 4995389 .4810146 .524585 0 3.688879
lnfamily_size 4995447 1.140025 .8218295 0 4.025352
lnbwd_cits 4995447 2.279622 .9111179 0 8.467373
lnnpl_cits 4995447 .5459317 .9279244 0 7.301148
originality 4890962 .7088248 .2202669 0 .9938309
radicalness 4893208 .3544737 .2700929 0 1
upward_tech_cycle 4929540 .737313 .6754943 -1 1
lntech_age 4929735 2.683784 .8805059 0 3.555348
lnnumb_words_ind 5008415 3.657549 .3697721 0 7.986845
lnnb_inventors 5008495 1.129451 .4239406 0 4.343805
USPTO 5008495 .7765179 .4165788 0 1

Methods

Our method to investigate the relationship between the introduced patent characteristics and the patent classification contains two steps. First, we reduce the number of patent characteristics by applying a factor analysis. Second, we use the resulting factors in a multinomial logit model (see [51]) to estimate their relationship with our three patent categories. The following subsection describes the process of extracting the latent factors and their use in the multinomial logit model.

Factor analysis and multinomial logit model

With the goal of higher interpretability of the subsequent multinomial logit model, we aim at reducing the number of variables included. We therefore apply a factor analysis (see for example [52]) to find common factors of the patent characteristics introduced above. Our justification to apply a factor model to the patent characteristics relies on the fact that our candidate variables can have a high degree of overlap in their measurement and intuition. For example, a couple of variables rely on claims or citations. This can lead to multicollinearity problems in the estimates, which complicates the interpretation of the results. To avoid this problem, we introduce the orthogonal factors from the factor analysis into the multinomial logit estimation.

We apply the Kaiser criterion [53] and drop all factors with eigenvalues smaller than 1.0 (see the scree plot in Fig 5).

Fig 5. Scree plot with eigenvalues.

Fig 5

This leaves us with four factors that are presented in Table 4, which shows the rotated factor loadings for the variables lnclaims_ind, claims_per_ind_claim, lnpatent_scope, lnbwd_cits, lnnpl_cits, lntech_age, originality and radicalness. The absolute loadings bigger than 0.35 are marked in bold, indicating the variables with the strongest loadings on the respective factor.

Table 4. Rotated factor loadings (pattern matrix) and unique variances.
Variable Factor 1 Factor 2 Factor 3 Factor 4 Uniqueness
lnclaims_ind 0.3401 0.0510 -0.0398 -0.7662 0.2931
claims_per_ind_claim 0.1677 0.0080 -0.0084 0.8633 0.2264
lnpatent_scope 0.1889 0.0407 0.8631 0.0394 0.2161
lnbwd_cits 0.7635 0.2328 -0.0800 -0.0585 0.3530
lnnpl_cits 0.8066 -0.0108 0.0608 0.0039 0.3456
lntech_age 0.2744 -0.0072 -0.7896 0.0249 0.3005
originality 0.2900 0.8011 0.2360 -0.0071 0.2185
radicalness -0.0571 0.9030 -0.1093 -0.0202 0.1689

Absolute loadings > 0.35 in bold

In the subsequent estimation, the four factors are named as follows: “Citations”, with the strongest factor loadings on lnbwd_cits and lnnpl_cits, “originality and radicalness” with the strongest factor loadings on originality and radicalness, “age and scope” with the strongest factor loadings on lnpatent_scope and lntech_age, and “claims”, with the strongest loadings on lnclaims_ind and claims_per_ind_claim. We also estimated additional models, including the full set of our variables and evaluated them against technical criteria (Kaiser criterion) as well as the intuition of the resulting factors.

We apply a multinomial logit estimator to distinguish the importance of the factors for our categorization outcome, specified in Eq (1). In our specification, we use the dependent variable y ∈ (0, 1, 2) to represent the outcomes product patents (j = 0), process patents (j = 1) and mixed patents (j = 2), where product patents (j = 2) are the base category.

Prob(Yi=j|xi)=exp(xiβj)j=02exp(xiβj)),j=0,1,2 (1)

We use the estimations from the above factor model as variables in our multinomial logit estimation with robust standard errors, in the sense of a principle component regression (see e.g. [54, 55]). Referring to Eq (1), x is a vector containing the estimations from our factor model and additional independent variables considered to be relevant to explain the classification outcome. We estimate a reduced (Table 5) and a full specification (Table 6), the latter containing the additional variables left out from the factor model. All estimations include time dummies, inventor countries and technology weights. We refrain from reporting tests on the independence of irrelevant alternatives because those tests are susceptible to size bias and model complexity, which applies to our estimates because they are based on millions of observations (patents) and a large number of independent variables [56].

Table 5. Multinomial logit reduced model—dependent variable: Patent category, base category: product patent.
categorization based on all claims categorization based on only independent claims
process patent (1) mixed patent (2) process patent (3) mixed patent (4)
Citations 0.0877 *** 0.282 *** 0.0355 *** 0.231 ***
(0.00212) (0.00149) (0.00196) (0.00155)
Originality and Radicalness 0.113 *** 0.101 *** 0.108 *** 0.110 ***
(0.00172) (0.00120) (0.00161) (0.00131)
Age and Scope 0.109 *** 0.0939 *** 0.0985 *** 0.0632 ***
(0.00175) (0.00130) (0.00162) (0.00141)
Claims 0.0332 *** -0.660 *** 0.101 *** -1.153 ***
(0.00156) (0.00160) (0.00154) (0.00183)
USPTO 0.0647 *** -0.859 *** 0.00450 -0.713 ***
(0.00462) (0.00329) (0.00406) (0.00372)
Constant -0.677 *** 0.628 *** -0.572 ** 0.147
(0.195) (0.160) (0.178) (0.168)
Observations 4800465 4800110
Pseudo R2 0.186 0.204
chi2 1259919.8 1343087.3

Robust standard errors in parentheses.

* p < 0.05,

** p < 0.01,

*** p < 0.001. Time dummies, inventor country and technology shares included.

Table 6. Multinomial logit full model—dependent variable: Patent category, base category: product patent.
categorization based on all claims categorization based on only independent claims
process patent (1) mixed patent (2) process patent (3) mixed patent (4)
Citations 0.0691 *** 0.275 *** 0.0228 *** 0.226 ***
(0.00217) (0.00152) (0.00201) (0.00160)
Originality and Radicalness 0.112 *** 0.0923 *** 0.109 *** 0.102 ***
(0.00172) (0.00121) (0.00161) (0.00132)
Age and Scope 0.0955 *** 0.0874 *** 0.0889 *** 0.0544 ***
(0.00183) (0.00135) (0.00169) (0.00146)
Claims 0.0283 *** -0.652 *** 0.0973 *** -1.145 ***
(0.00156) (0.00161) (0.00155) (0.00184)
USPTO 0.0835 *** -0.851 *** 0.0122 ** -0.696 ***
(0.00478) (0.00343) (0.00422) (0.00388)
upward_tech_cycle -0.0648 *** -0.00653 *** -0.0561 *** 0.000445
(0.00236) (0.00180) (0.00218) (0.00197)
lnfamily_size 0.00263 -0.0193 *** -0.00493 * 0.00813 ***
(0.00236) (0.00173) (0.00220) (0.00185)
lnnb_inventors 0.251 *** 0.200 *** 0.182 *** 0.0915 ***
(0.00392) (0.00288) (0.00366) (0.00311)
lnnumb_words_ind 0.0513 *** -0.470 *** 0.135 *** -0.545 ***
(0.00438) (0.00354) (0.00402) (0.00371)
Constant -1.552 *** 1.712 *** -1.515 *** 1.703 ***
(0.196) (0.161) (0.179) (0.169)
Observations 4800271 4799916
Pseudo R2 0.189 0.208
chi2 1272444.8 1350628.1

Robust standard errors in parentheses.

* p < 0.05,

** p < 0.01,

*** p < 0.001.

Time dummies, inventor country and technology shares included.

Results

Tables 5 and 6 show coefficients for the specification with only factor scores and with other important characteristics, respectively. The results presented in here use product patents as base category. Estimates based on process and mixed patents as base categories can be found in the (S1 Table).

If a patent applicant or inventor were to add more citations of prior art, to invent more original or radical patents, or expand the technological scope in younger technologies, the multinomial log-odds for process and mixed patents relative to product patents would be expected to increase while holding all other variables in the model constant. A higher number of backward citations (Citations), higher originality and radicalness, and technological scope, as well as a younger technology (Age and Scope) thus increase the likelihood of getting either a process or mixed patent granted compared to the likelihood of getting a product patent. The association between Citations is particularly strong with mixed patents. This indicates that a higher reliance on predecessor inventions and research mainly leads to more mixed patents rather than product patents. The strong association between originality and radicalness with both process and mixed patents suggests that they are indeed more complex than pure product patents, because they not only refer to a broader range of IPC codes in their backward citations, but also deviate from prior technological paradigms. The result for technological scope points in the same direction. As the factor loading for technological age is negative, the estimation results suggest that the younger the technology, the more likely the granted patent is mixed or a process one. Thus, process and mixed patents are not only younger than product inventions, they also have a higher degree of novelty, which is most likely associated with higher risk in their development. Given the descriptive results above showing that the proportion of mixed patents has increased, these estimation results suggest that invention risk has also increased over time, at least in some technologies.

Claims is the only factor that shows a negative relationship with mixed patents, but a positive with process patents. The more dependent claims per independent claim a patent has, the more likely it will be a pure process patent, and the less likely a mixed patent compared to being a product patent. Conversely, the more independent claims a patent has, the less likely it will be a pure process patent, and the more likely a mixed patent. Even though the results must be interpreted with caution, a higher number of dependent claims per independent claims might reflect either high complexity, requirements from the examiners or strategic considerations. Van Zeebroeck, de la Potterie, & Guellec (2009) studied the contribution of the diffusion of national practices, technological complexity, emerging sectors and patenting strategies in explaining the number of claims of EPO patents [57]. Even though all elements are important, they found that institutional influences and the international harmonization are the most important factors. Finally, filing a patent application at the USPTO reduces the likelihood of getting a mixed patent.

In an extended specification, we include further important variables to characterize the different patent categories. Filing a patent application in the upward cycle of a technology reduces the likelihood of getting a process patent granted. This seems to be in line with the theoretical framework that firms draw on processes at a later stage of the product cycle in order to sell into mass markets. But it is at odds with the result that younger technologies are associated with more process inventions. However, it could also indicate that process patents are more frequently found in technologies with shorter product life cycles, where technological maturity occurs earlier. This means that the product development opportunities are exhausted very soon and process inventions gain in importance. Furthermore, it could well be that patenting in younger technologies only in combination with technological complexity explains the strong correlation with process patenting, as indicated by the age and scope factor.

The size of the patent family is often associated with the commercialization potential of the invention. The positive correlation between mixed patents and family size in the case of independent claims and the negative correlation when all claims are classified may therefore indicate that mixed patents only have larger commercialization potential if both product and process characteristics are covered in the independent claims. In turn, adding process elements in the dependent claims seems to be associated with less commercialisation potential.

The size of the inventor team makes both process and mixed patents more likely. This might also reflect the findings that complex technologies such as software developments are typically based on larger teams as knowledge about those technologies can be transferred rather easily (for example, code can be shared on repositories). Finally, a higher number of words per independent claim decreases the likelihood of process patents and increases the likelihood of mixed patents. To recall, a higher number of words indicates less patent protection breadth. Therefore, pure process patents might be more difficult to protect than product inventions, which comes at no surprise and is mentioned in most survey articles on product and process innovations [15]. The result for mixed patent suggests that they can be protected easier than product patents which is much more surprising.

Conclusions

The goal of this paper is to introduce a comprehensive classification of product and process patents based on keywords in claims and to provide descriptive evidence on potential relationships with fundamental patent characteristics. We show that the share of patents that contain both product and process claims (mixed patents) has increased tremendously. The share of pure process patents is still rather low. The trend towards including more and more process claims in addition to product claims can have many reasons, such as a larger complexity of the underlying technologies, a general technological exhaustion, strategic patenting behavior of large firms, or specific requirements from the patent offices and examiners. Generally, we observe large differences across technologies with computer technology having the largest share of mixed patents. As the patent premium also fluctuates widely across industries [58], such fluctuations might be also related to the industry’s fraction of mixed and particularly process patents.

The resulting database can be of great use in order to study different aspects of the R&D and patenting process in more detail. For example, it enables researchers to distinguish between different types of knowledge accumulation in the framework of a knowledge production function, and to conduct more in-depth industry studies.

The characterization of product, process and mixed patents is an important contribution to different strands of empirical and theoretical literature and can contribute to a better understanding of R&D processes in companies. In particular, our analysis of relationships between fundamental patent characteristics and the different patent categories shows that younger technologies and potentially more complex technology combinations are associated with more process and mixed patents rather than product patents. Especially mixed patents seem to draw on already available knowledge and might be thus more science-based, original and radical than pure product patents. We do not find evidence that mixed patents are filed for mainly strategic reasons. A higher number of dependent claims per independent claim which may indicate strategic considerations or legal requirements seems to be associated with more process patents, but with less mixed patents. Our analysis confirms the notion that process inventions are more difficult to protect than product inventions, which is the reason why for many processes secrecy or lead-time advantages are the preferred appropriation mechanism.

A further finding is that larger inventor teams are more likely to engage in process-related research projects. Since the size of inventor teams is positively correlated with the likelihood of filing process patents rather than product or mixed patents, the positive relationship between inventor teams and performance which other studies found could be related to the fact that teams are more likely to develop process inventions. Thus, this study points to a previously neglected factor that could improve our understanding of the role of inventor teams in productivity. A plausible reason why larger inventor teams are more likely to engage in process-related research projects might be that processes are at the same time more complex and riskier than product inventions. Consequently, they might also have a greater market potential if the research project is successful which then translates into higher performance.

The study is subject to some limitations. First, in general, patents are not the preferred mechanism to appropriate newly created knowledge, especially for smaller firms [59, 60]). This is especially true for process inventions and implies that most of them remain unobservable. Second, although we attained robust classification results which were confirmed by robustness tests with different keyword combinations, we cannot completely rule out the possibility that additional keywords could slightly alter the results. Third, with only three patent types (product, process and mixed) that are used in our analysis we admittedly loose a lot of valuable information. Further research might try to uncover more nuances with respect to patents having different shares of process and product claims.

Further research might also develop and apply more advanced text mining and scraping methods and perhaps extend the scope beyond patent data. For example, product data on firms’ websites or protocols from R&D labs might deliver more accurate pictures of the amount of process inventions. Finally, it is of particular interest to study diminishing returns to R&D given that it is getting harder to invent commercially successful technologies [61], a phenomenon that has caught attention in recent years. Our patent classification might provide useful metrics in order to study it.

Supporting information

S1 File. Appendix.

This appendix describes the implementation of the keyword search, the identification of independent and product-by-process claims, and the construction of a measure for growing technologies.

(PDF)

S1 Table. Regression results with different basis.

These tables contain the results from the multinomial logit model with different base categories.

(PDF)

Data Availability

The data are available to the public on Harvard Dataverse (DOI: https://doi.org/10.7910/DVN/CBSK2W).

Funding Statement

We are grateful to the European Patent Office Academic Research Programme for the financial support of this project. Part of Sebastian Heinrich’s salary was funded by this grant. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1. Tirole J. The theory of industrial organization. Cambridge, MA: MIT press; 1988. [Google Scholar]
  • 2. Reksulak M, Shughart WF, Tollison RD. Innovation and the opportunity cost of monopoly. Managerial and Decision Economics. 2008;29(8):619–627. doi: 10.1002/mde.1425 [DOI] [Google Scholar]
  • 3. Vives X. Innovation and competitive pressure. The Journal of Industrial Economics. 2008;56(3):419–469. doi: 10.1111/j.1467-6451.2008.00356.x [DOI] [Google Scholar]
  • 4. Rosenkranz S. Simultaneous choice of process and product innovation when consumers have a preference for product variety. Journal of Economic Behavior & Organization. 2003;50(2):183–201. doi: 10.1016/S0167-2681(02)00047-1 [DOI] [Google Scholar]
  • 5. Boone J. Competitive pressure: the effects on investments in product and process innovation. The RAND Journal of Economics. 2000; p. 549–569. doi: 10.2307/2601000 [DOI] [Google Scholar]
  • 6. Bonanno G, Haworth B. Intensity of competition and the choice between product and process innovation. International Journal of Industrial Organization. 1998;16(4):495–510. doi: 10.1016/S0167-7187(97)00003-9 [DOI] [Google Scholar]
  • 7. Lin P, Saggi K. Product differentiation, process R&D, and the nature of market competition. European Economic Review. 2002;46(1):201–211. doi: 10.1016/S0014-2921(00)00090-8 [DOI] [Google Scholar]
  • 8. Beneito P, Coscollá-Girona P, Rochina-Barrachina ME, Sanchis A. Competitive pressure and innovation at the firm level. The Journal of industrial economics. 2015;63(3):422–457. doi: 10.1111/joie.12079 [DOI] [Google Scholar]
  • 9. Tang J. Competition and innovation behaviour. Research policy. 2006;35(1):68–82. doi: 10.1016/j.respol.2005.08.004 [DOI] [Google Scholar]
  • 10. Pisano GP. The development factory: unlocking the potential of process innovation. Harvard Business Press; 1997. [Google Scholar]
  • 11. Bena J, Ortiz-Molina H, Simintzi E. Shielding firm value: Employment protection and process innovation. Journal of Financial Economics. 2021;. doi: 10.1016/j.jfineco.2021.10.005 [DOI] [Google Scholar]
  • 12. Utterback JM, Abernathy WJ. A dynamic model of process and product innovation. Omega. 1975;3(6):639–656. doi: 10.1016/0305-0483(75)90068-7 [DOI] [Google Scholar]
  • 13. Klepper S. Entry, exit, growth, and innovation over the product life cycle. The American economic review. 1996;86(2):562–583. [Google Scholar]
  • 14. Levin RC, Klevorick AK, Nelson RR, Winter SG, Gilbert R, Griliches Z. Appropriating the returns from industrial research and development. Brookings papers on economic activity. 1987;1987(3):783–831. doi: 10.2307/2534454 [DOI] [Google Scholar]
  • 15. Hall B, Helmers C, Rogers M, Sena V. The choice between formal and informal intellectual property: a review. Journal of Economic Literature. 2014;52(2):375–423. doi: 10.1257/jel.52.2.375 [DOI] [Google Scholar]
  • 16. Griliches Z. Issues in assessing the contribution of research and development to productivity growth. The bell journal of economics. 1979; p. 92–116. doi: 10.2307/3003321 [DOI] [Google Scholar]
  • 17. Hall BH, Mairesse J, Mohnen P. Measuring the returns to R&D. In: Hall BH, Rosenberg N, editors. Handbook of the Economics of Innovation. vol. 2. Amsterdam: Elsevier B.V.; 2010. p. 1033–1082. [Google Scholar]
  • 18. Mairesse J, Hall BH. Estimating the productivity of research and development: An exploration of GMM methods using data on French & United States manufacturing firms. NBER working paper. 1996;(w5501). [Google Scholar]
  • 19. Ugur M, Trushin E, Solomon E, Guidi F. R&D and productivity in OECD firms and industries: A hierarchical meta-regression analysis. Research Policy. 2016;45(10):2069–2086. doi: 10.1016/j.respol.2016.08.001 [DOI] [Google Scholar]
  • 20. Lunn J. An empirical analysis of firm process and product patenting. Applied Economics. 1987;19(6):743–751. doi: 10.1080/00036848700000106 [DOI] [Google Scholar]
  • 21. Bena J, Simintzi E. Globalization of work and innovation: Evidence from doing business in china. Available at SSRN. 2017;. [Google Scholar]
  • 22. Griffith R, Huergo E, Mairesse J, Peters B. Innovation and productivity across four European countries. Oxford review of economic policy. 2006;22(4):483–498. doi: 10.1093/oxrep/grj028 [DOI] [Google Scholar]
  • 23. Adner R, Levinthal D. Demand heterogeneity and technology evolution: implications for product and process innovation. Management science. 2001;47(5):611–628. doi: 10.1287/mnsc.47.5.611.10482 [DOI] [Google Scholar]
  • 24. Hatch NW, Mowery DC. Process innovation and learning by doing in semiconductor manufacturing. Management Science. 1998;44(11-part-1):1461–1477. doi: 10.1287/mnsc.44.11.1461 [DOI] [Google Scholar]
  • 25. Reichstein T, Salter A. Investigating the sources of process innovation among UK manufacturing firms. Industrial and Corporate change. 2006;15(4):653–682. doi: 10.1093/icc/dtl014 [DOI] [Google Scholar]
  • 26. Trantopoulos K, von Krogh G, Wallin MW, Woerter M. External knowledge and information technology: Implications for process innovation performance. MIS quarterly. 2017;41(1):287–300. doi: 10.25300/MISQ/2017/41.1.15 [DOI] [Google Scholar]
  • 27. Scherer FM. Inter-industry technology flows and productivity growth. The Review of Economics and Statistics. 1982;64(4):627–634. doi: 10.2307/1923947 [DOI] [Google Scholar]
  • 28. Hall B, Helmers C, Rogers M, Sena V. The choice between formal and informal intellectual property: a review. Journal of Economic Literature. 2014;52(2):375–423. doi: 10.1257/jel.52.2.375 [DOI] [Google Scholar]
  • 29. Bena J, Ortiz-Molina H, Simintzi E. Shielding firm value: Employment protection and process innovation. 2018. 10.2139/ssrn.3223176 [DOI] [Google Scholar]
  • 30. Bena J, Simintzi E. Machines could not compete with Chinese labor: Evidence from US firms’ innovation. 2019. 10.2139/ssrn.2613248 [DOI] [Google Scholar]
  • 31. Ganglmair B, Reimers I. Visibility of Technology and Cumulative Innovation: Evidence from Trade Secrets Laws. 2019;. [Google Scholar]
  • 32. Trajtenberg M, Henderson R, Jaffe A. University versus corporate patents: A window on the basicness of invention. Economics of Innovation and new technology. 1997;5(1):19–50. doi: 10.1080/10438599700000006 [DOI] [Google Scholar]
  • 33. Banholzer N, Behrens V, Feuerriegel S, Heinrich S, Rammer C, Schmoch U, et al. Knowledge spillovers from product and process inventions in patents and their impact on firm performance. End report; 2019. [Google Scholar]
  • 34.European Patent Office. Guidelines for Examination in the European Patent Office. Munich: EPO; 2017. Available from: https://www.epo.org/law-practice/legal-texts/guidelines.html.
  • 35.United States Patent and Trademark Office. Manual of Patent Examining Procedure—Chapter 2100 Patentability. USPTO; 2015. Available from: https://www.uspto.gov/web/offices/pac/mpep/mpep-2100.html.
  • 36. European Patent Office. Market success for inventions—Patent commercialisation scoreboard: European SMEs. Munich; 2019. [Google Scholar]
  • 37. Dosi G. Technological paradigms and technological trajectories. A suggested interpretation of the determinants and directions of technical change. Research Policy. 1982;11(3):147–162. doi: 10.1016/0048-7333(82)90016-6 [DOI] [Google Scholar]
  • 38.Squicciarini M, Dernis H, Criscuolo C. Measuring patent quality. 2013;.
  • 39. Lerner J. The importance of patent scope: an empirical analysis. The RAND Journal of Economics. 1994; p. 319–333. doi: 10.2307/2555833 [DOI] [Google Scholar]
  • 40. Harhoff D, Scherer FM, Vopel K. Citations, family size, opposition and the value of patent rights. Research policy. 2003;32(8):1343–1363. doi: 10.1016/S0048-7333(02)00124-5 [DOI] [Google Scholar]
  • 41. Branstetter L. Exploring the link between academic science and industrial innovation. Annales d’Economie et de Statistique. 2005; p. 119–142. doi: 10.2307/20777572 [DOI] [Google Scholar]
  • 42. Shane S. Technological opportunities and new firm creation. Management science. 2001;47(2):205–220. doi: 10.1287/mnsc.47.2.205.9837 [DOI] [Google Scholar]
  • 43. Tong X, Frame JD. Measuring national technological performance with patent claims data. Research Policy. 1994;23(2):133–141. doi: 10.1016/0048-7333(94)90050-7 [DOI] [Google Scholar]
  • 44. Lanjouw JO, Schankerman M. Patent quality and research productivity: Measuring innovation with multiple indicators. The Economic Journal. 2004;114(495):441–465. doi: 10.1111/j.1468-0297.2004.00216.x [DOI] [Google Scholar]
  • 45. Kuhn JM, Thompson NC. How to measure and draw causal inferences with patent scope. International Journal of the Economics of Business. 2019;26(1):5–38. doi: 10.1080/13571516.2018.1553284 [DOI] [Google Scholar]
  • 46. Marco AC, Sarnoff JD, Charles AW. Patent claims and patent scope. Research Policy. 2019; p. 103790. doi: 10.1016/j.respol.2019.04.014 [DOI] [Google Scholar]
  • 47. Akcigit U, Caicedo S, Miguelez E, Stantcheva S, Sterzi V. Dancing with the stars: Innovation through interactions. National Bureau of Economic Research; 2018. [Google Scholar]
  • 48. Gandal N, Kunievsky N, Branstetter L. Network-Mediated Knowledge Spillovers in ICT/Information Security. Review of Network Economics. 2020;19(2):85–114. doi: 10.1515/rne-2020-0034 [DOI] [Google Scholar]
  • 49. Forman C, Zeebroeck Nv. From wires to partners: How the Internet has fostered R&D collaborations within firms. Management science. 2012;58(8):1549–1568. doi: 10.1287/mnsc.1110.1505 [DOI] [Google Scholar]
  • 50. Akcigit U, Stantcheva S. Taxation and Innovation: What Do We Know? National Bureau of Economic Research; 2020. [Google Scholar]
  • 51.Greene WH. Econometric Analysis. Pearson; 2018. Available from: https://books.google.ch/books?id=xGZRvgAACAAJ.
  • 52. Mulaik SA. Foundations of factor analysis. CRC press; 2009. [Google Scholar]
  • 53. Kaiser HF. The application of electronic computers to factor analysis. Educational and psychological measurement. 1960;20(1):141–151. doi: 10.1177/001316446002000116 [DOI] [Google Scholar]
  • 54. Hotelling H. The relations of the newer multivariate statistical methods to factor analysis. British Journal of Statistical Psychology. 1957;10(2):69–79. doi: 10.1111/j.2044-8317.1957.tb00179.x [DOI] [Google Scholar]
  • 55. Stock JH, Watson MW. Forecasting using principal components from a large number of predictors. Journal of the American statistical association. 2002;97(460):1167–1179. doi: 10.1198/016214502388618960 [DOI] [Google Scholar]
  • 56. Cheng S, Long JS. Testing for IIA in the multinomial logit model. Sociological methods & research. 2007;35(4):583–600. doi: 10.1177/0049124106292361 [DOI] [Google Scholar]
  • 57. Van Zeebroeck N, de la Potterie BvP, Guellec D. Claiming more: the increased voluminosity of patent applications and its determinants. Research Policy. 2009;38(6):1006–1020. doi: 10.1016/j.respol.2009.02.004 [DOI] [Google Scholar]
  • 58. Arora A, Ceccagnoli M, Cohen WM. R&D and the patent premium. International journal of industrial organization. 2008;26(5):1153–1179. doi: 10.1016/j.ijindorg.2007.11.004 [DOI] [Google Scholar]
  • 59. Swann GP. The economics of innovation: an introduction. Edward Elgar Publishing; 2014. [Google Scholar]
  • 60. Cohen WM, Goto A, Nagata A, Nelson RR, Walsh JP. RandD spillovers, patents and the incentives to innovate in Japan and the United States. Research Policy. 2002. doi: 10.1016/S0048-7333(02)00068-9 [DOI] [Google Scholar]
  • 61. Bloom N, Jones CI, Van Reenen J, Webb M. Are ideas getting harder to find? American Economic Review. 2020;110(4):1104–44. doi: 10.1257/aer.20180338 [DOI] [Google Scholar]

Decision Letter 0

Wonjoon Kim

4 Aug 2021

PONE-D-21-18643

Identifying and characterising product and process inventions in patent data

PLOS ONE

Dear Dr. Seliger,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

The review process for your manuscript entitled "Identifying and characterizing product and process inventions in patent data," which you submitted to PLOS ONE, is now complete. I have received comments from two reviewers. The review team thoroughly reviewed the manuscript and both reviewers recommend major revision. Please read carefully the comments and submit the revised manuscript.

Please submit your revised manuscript by Sep 18 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Wonjoon Kim, Ph.D

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at 

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. We note that the grant information you provided in the ‘Funding Information’ and ‘Financial Disclosure’ sections do not match. 

When you resubmit, please ensure that you provide the correct grant numbers for the awards you received for your study in the ‘Funding Information’ section.

3. Thank you for stating the following in the Acknowledgments Section of your manuscript: 

We are grateful to the European Academic Research Programme for the financial support of this project.

We note that you have provided additional information within the Acknowledgements Section that is not currently declared in your Funding Statement. Please note that funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form. 

Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows: 

We received funding from the European Patent Office Academic Research Programme: https://www.epo.org/learning/materials/academic-research-programme.html

Part of Sebastian Heinrich's salary was funded by this grant.

NO - The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Additionally, because some of your funding information pertains to [commercial funding//patents], we ask you to provide an updated Competing Interests statement, declaring all sources of commercial funding. 

In your Competing Interests statement, please confirm that your commercial funding does not alter your adherence to PLOS ONE Editorial policies and criteria by including the following statement: "This does not alter our adherence to PLOS ONE policies on sharing data and materials.” as detailed online in our guide for authors  http://journals.plos.org/plosone/s/competing-interests.  If this statement is not true and your adherence to PLOS policies on sharing data and materials is altered, please explain how. 

Please include the updated Competing Interests Statement and Funding Statement in your cover letter. We will change the online submission form on your behalf.

4. We noted in your submission details that a portion of your manuscript may have been presented or published elsewhere. [http://documents.epo.org/projects/babylon/eponet.nsf/0/A69432F980D71284C12584F5003DE05C/$File/ARP_report_Woerter_en.pdf] Please clarify whether this publication was peer-reviewed and formally published. If this work was previously peer-reviewed and published, in the cover letter please provide the reason that this work does not constitute dual publication and should be included in the current manuscript.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: No

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The paper uses accessible patent data to classify patents into process patents, product patents, and mixed patents, which is an important study to promote the understanding of process patents, which are of particular interest in patent analysis. I will be happy to recommend the publication of the paper after the authors address a few points, some of which are quite important in my opinion.

My main concern is that the statistical validity of the method used to identify process patents, product patents and mixed patents, which is one of the core contributions of the study, is not ensured. The authors say that "In contrast, the identification of processes is quite complete if using the extracted keywords". However, I am not sure why only text information can be used to classify process patents when other information is available such as article citations and IPC. Moreover, the study uses keywords based on the patent's expert as queries in order to classify each patent, but there is no statistical explanation as to whether these keywords are appropriate enough to obtain a comprehensive list of process patents. The paper only obtains patents where the set keywords appear at the beginning of the sentence, and there must be sufficient evidence to show that these are equal to process patents, product patents or mixed patents. It would be good to compare several methods to show that the proposed method is a better one.

Also, the analysis of the word window used to classify each claim described in Supporting Information 1 needs to be tested to ensure the validity of the method, e.g. by checking how many words are identified as process patents (e.g. 1-10 words) rather than visually comparing 2-words and 5-words criterion.

With these experiments, it can be said that the characteristics of each patent have been clarified, as mentioned in the Conclusion.

And why not do the same analysis for process patents and mixed patents? Although this study compares process patents and mixed patents using product patents as a base, the comparison of the two relatively new forms of patents(mixed and process) will enable a more detailed description of the characteristics of each of the three patents.

Minor

In some figures, authors could make it more legible. For example, in Fig.1, what does the name of y-axis "Count in 1000" mean? Comparing the EPO and the USPTO, the USPTO has a larger scale on the y-axis, but where does this difference come from? To illustrate the percentage of each patent as mentioned in page 5, it would be easier to read if stacked bar graphs showing the percentage of each patent was added behind the plot.

In page 8 line 246, the word "nnlp_cits" must be "lnnpl_cits".

In Table1. process keywords and use keywords should be written separately.

Reviewer #2: This study focuses on identifying the product and process inventions and characterizing them using patent data. The results provide some meaningful technological trend and practical implications. However, there are several points needed to be addressed for more complete one.

1 This study lacks an academic contribution. The theme is quite technical so that the research motivation is more or less unclear. It means this study has a weak connection to theoretical literature. Simply identifying product and process patents and linking them to the fundamental characters of patents are not enough to publish in the academic journal. In that sense why this study is necessary has to be firmly addressed in the theoretical context: economic theory, technology management, innovation theory, and so on. Also, the authors need to shed light on the novelty of this study and its results compared to the previous studies and the relevant literature.

2 Similarly, in order to overcome the limited value of this study, interpreting the results should be extended to the theoretical literature. How they are meaningful in the technology management, innovation, industry trend, and so on should be addressed.

3 The conclusion part is too short to enrich its academic value. The present conclusion seems a repetition of the findings rather than a discussion on the main findings, their limitations, and further studies. Inclusion of discussion part is strongly recommended.

4. The present title seems to be quite neutral. Title modification is also recommended, which reflects its academic contribution.

5 There is no numbering of subtitles. It may lead the readers to a little confusion to follow the paper. It should be more systematic.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2022 Aug 15;17(8):e0272225. doi: 10.1371/journal.pone.0272225.r002

Author response to Decision Letter 0


26 Jan 2022

Editors’ comments:

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

Answer: We have adjusted the manuscript and naming of files according to the style templates.

2. We note that the grant information you provided in the ‘Funding Information’ and ‘Financial Disclosure’ sections do not match.

When you resubmit, please ensure that you provide the correct grant numbers for the awards you received for your study in the ‘Funding Information’ section.

Answer: There was no specific grant number for this funding program. If you need further information, let us kindly know it.

3. Thank you for stating the following in the Acknowledgments Section of your manuscript:

We are grateful to the European Academic Research Programme for the financial support of this project.

We note that you have provided additional information within the Acknowledgements Section that is not currently declared in your Funding Statement. Please note that funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form.

Please remove any funding-related text from the manuscript …

Answer: We have removed the Acknowledgements.

…and let us know how you would like to update your Funding Statement.

Currently, your Funding Statement reads as follows:

We received funding from the European Patent Office Academic Research Programme: https://www.epo.org/learning/materials/academic-research-programme.html

Part of Sebastian Heinrich's salary was funded by this grant.

NO - The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Answer: Please update the Funding Statement as follows:

We are grateful to the European Academic Research Programme for the financial support of this project.

Part of Sebastian Heinrich's salary was funded by this grant.

The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Additionally, because some of your funding information pertains to [commercial funding//patents], we ask you to provide an updated Competing Interests statement, declaring all sources of commercial funding.

In your Competing Interests statement, please confirm that your commercial funding does not alter your adherence to PLOS ONE Editorial policies and criteria by including the following statement: "This does not alter our adherence to PLOS ONE policies on sharing data and materials.” as detailed online in our guide for authors http://journals.plos.org/plosone/s/competing-interests. If this statement is not true and your adherence to PLOS policies on sharing data and materials is altered, please explain how.

Please include the updated Competing Interests Statement and Funding Statement in your cover letter. We will change the online submission form on your behalf.

Answer: We’ve included the statement in the cover letter.

4. We noted in your submission details that a portion of your manuscript may have been presented or published elsewhere. [http://documents.epo.org/projects/babylon/eponet.nsf/0/A69432F980D71284C12584F5003DE05C/$File/ARP_report_Woerter_en.pdf] Please clarify whether this publication was peer-reviewed and formally published. If this work was previously peer-reviewed and published, in the cover letter please provide the reason that this work does not constitute dual publication and should be included in the current manuscript.

Answer: As mentioned in the text, the project was conducted for the EPO Academic Research Programme which also provided funding. The project report contains a lot of detailed information on which we also needed to draw in this publication. This publication has a different topic than the very comprehensive report as we characterize product and process patens with patent indicators etc. However, it is not possible to avoid some overlap when it comes to describing the classification process and the database, which has been cited accordingly. The project report is not a peer-reviewed paper. It is solely a project report that has been published on the EPO’s website.

Reviewer #1:

The paper uses accessible patent data to classify patents into process patents, product patents, and mixed patents, which is an important study to promote the understanding of process patents, which are of particular interest in patent analysis. I will be happy to recommend the publication of the paper after the authors address a few points, some of which are quite important in my opinion.

Answer: Dear reviewer,

Thank you for your thorough reading of the manuscript and your suggestions to improve it.

My main concern is that the statistical validity of the method used to identify process patents, product patents and mixed patents, which is one of the core contributions of the study, is not ensured. The authors say that "In contrast, the identification of processes is quite complete if using the extracted keywords". However, I am not sure why only text information can be used to classify process patents when other information is available such as article citations and IPC.

Answer: Concerning the usage of other information such as citations or IPC, it appears to us that citations or IPCs do not inherently contain information about the difference between products and processes.

We were indeed considering to use the IPC in order to classify the patents in the beginning of the project. However, classification (IPC, CPC or whatever system is used) is a means to index the content and to make it more accessible for the examiner when searching for prior art, but it doesn’t provide consistent information to classify patents into product and process inventions. We have been pointed to this issue by patent experts (e.g. from the Fraunhofer Institute for Systems and Innovation Research ISI) and were advised against using IPC.

Nevertheless, we checked a sample of patents and came to the same conclusion, namely that it is not possible to classify patents unambiguously based on the IPC. In the IPC, there are classes or subclasses whose names seem to indicate that they must solely contain either product or process patents, respectively. But this is usually not the case and you will both find product patents in IPC sections that seem to only include processes, and conversely.

Claims, in turn, express the scope of protection sought (patent application) or actually obtained (granted patent). We find this kind of information to be more useful for our classification purposes, especially because the EPO and USPTO also have detailed examination guidelines where they define product and process claims in detail. We also want to point out that by using full-text data we can utilize much more and richer information than by relying on predefined metrics.

Finally, backward citations are the “prior art”, and mostly also have classification codes (patents for sure/NPL sometimes). We did not consider using citation data etc. for the same reasons. If we used citations, we would have needed to know if the cited patent is a process patent. But we do not know this ex ante, which is the reason why we propose a text-based classification for all patents.

Moreover, the study uses keywords based on the patent's expert as queries in order to classify each patent, but there is no statistical explanation as to whether these keywords are appropriate enough to obtain a comprehensive list of process patents.

Answer: The keywords are appropriate with regard to content in the sense that it is possible to find almost all process patents according to the examination guidelines. In addition, research assistants classified the claims from 1100 patents manually and their results were pretty close to the results from keyword search. We also manually inspected about 1000 patent texts in the aftermath and came to the same conclusion.

The paper only obtains patents where the set keywords appear at the beginning of the sentence, and there must be sufficient evidence to show that these are equal to process patents, product patents or mixed patents. It would be good to compare several methods to show that the proposed method is a better one.

Answer: We actually compared the keyword-based classification with a classification from a machine learning approach. The results are largely similar in their classification outcome, but by extensive manual inspection, we found that the keyword approach captures the class boundaries better. We’ve added a paragraph briefly describing the machine learning approach and the reason for focusing on the keyword approach.

Also the analysis of the word window used to classify each claim described in Supporting Information 1 needs to be tested to ensure the validity of the method, e.g. by checking how many words are identified as process patents (e.g. 1-10 words) rather than visually comparing 2-words and 5-words criterion.

With these experiments, it can be said that the characteristics of each patent have been clarified, as mentioned in the Conclusion.

Answer: With the visual comparison, we wanted to show that it doesn’t make much difference whether we use the first 2 or first 5 words. The criteria were chosen after inspecting thousands of claim texts manually and also the resulting classification were checked against thousands of claim texts (see also explanations above).

From manual inspection, it has become clear that we need to limit the classification to the first 2 (minimum) or first 5 words (maximum) of a claim text (we could also have tried 3 or 4, but we stuck to the boundaries).Please also see the detailed examples in S1.

We computed the accuracy based on the training dataset that has been labeled for the machine learning approach (see above). The accuracy was very high for the keyword search based on either the first 2 or 5 words (98). Therefore, we are convinced that the patent claims were classified correctly.

And why not do the same analysis for process patents and mixed patents? Although this study compares process patents and mixed patents using product patents as a base, the comparison of the two relatively new forms of patents(mixed and process) will enable a more detailed description of the characteristics of each of the three patents.

Answer:Many thanks for this advice. We now present additional tables with process patents as the base category to compare mixed with process patents directly. The advantage of presenting the result in this way is that it is easier to see the significant differences between process and mixed patents. Note that we still discuss the results with respect to the base category product patents in order to keep the Results section readable.

Minor

In some figures, authors could make it more legible. For example, in Fig.1, what does the name of y-axis "Count in 1000" mean? Comparing the EPO and the USPTO, the USPTO has a larger scale on the y-axis, but where does this difference come from? To illustrate the percentage of each patent as mentioned in page 5, it would be easier to read if stacked bar graphs showing the percentage of each patent was added behind the plot.

Answer: Fig. 1 to 4 depict simple patent counts and the y-axis expresses counts of granted patents in 1000. We have adjusted the axes labels for more clarity. The difference in scale comes from the discrepancy in patent volume between the two offices. We decided not to rescale the EPO plots’ axes, as the time series would have been more compressed and therefore less able to deliver the required information.

Concerning stacked bar graphs, we think that time trends can be better depicted in line charts, especially the relative changes between the different categories. We think that the line plots make the comparison of differences in time trends more easily understandable, while we still plot the total patent counts. If this request is of high importance to you, we can of course provide the figures in the suggested format.

In page 8 line 246, the word "nnlp_cits" must be "lnnpl_cits".

In Table1. process keywords and use keywords should be written separately.

Answer: We have corrected the error and changed the table according to your suggestion. Thank you.

Reviewer #2:

This study focuses on identifying the product and process inventions and characterizing them using patent data. The results provide some meaningful technological trend and practical implications. However, there are several points needed to be addressed for more complete one.

1 This study lacks an academic contribution. The theme is quite technical so that the research motivation is more or less unclear. It means this study has a weak connection to theoretical literature. Simply identifying product and process patents and linking them to the fundamental characters of patents are not enough to publish in the academic journal. In that sense why this study is necessary has to be firmly addressed in the theoretical context: economic theory, technology management, innovation theory, and so on. Also, the authors need to shed light on the novelty of this study and its results compared to the previous studies and the relevant literature.

Answer: Thanks a lot for your careful reading of the paper and your suggestions to improve it. We absolutely agree that the embedding in the literature should be expanded so that the contribution of the study can be better assessed beyond the pure classification of the patents. We had paid too little attention to this. In the new version of the manuscript, we have significantly expanded this part. In the new manuscript, we show in the introduction in which contexts the classification of patents can make an important contribution to theoretical and empirical literature without distracting the reader from the main contribution of the study.

2 Similarly, in order to overcome the limited value of this study, interpreting the results should be extended to the theoretical literature. How they are meaningful in the technology management, innovation, industry trend, and so on should be addressed.

Answer: We now discuss the results against the background of the existing literature and emphasize the contribution of this study in the Conclusions section.

3 The conclusion part is too short to enrich its academic value. The present conclusion seems a repetition of the findings rather than a discussion on the main findings, their limitations, and further studies. Inclusion of discussion part is strongly recommended.

Answer: We have extended the conclusions according to your suggestion and, most importantly, added a discussion of the results. We also present limitations and suggestions for further studies.

Note that the Results section also discusses results to some extent.

4. The present title seems to be quite neutral. Title modification is also recommended, which reflects its academic contribution.

Answer: We’ve changed the title to ‘Appropriability and basicness of R&D – identifying and characterizing product and process inventions in patent data’, We hope that you share our view that it reflects the contribution better.

5 There is no numbering of subtitles. It may lead the readers to a little confusion to follow the paper. It should be more systematic.

Answer: We have used the PlosOne Latex Template and followed all guidelines there. Articles in PlosOne don’t have a numbering of subtitles. Unfortunately, there is nothing we can do here.

Attachment

Submitted filename: Rebuttal letter.docx

Decision Letter 1

George Vousden

15 Jul 2022

Appropriability and basicness of R&D: Identifying and characterising product and process inventions in patent data

PONE-D-21-18643R1

Dear Dr. Seliger,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

George Vousden

Staff Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #2: (No Response)

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #2: This revision is much improved than the previous one in providing its theoretical meaning. And also the title has been changed and matched well with it. Now it seems more proper for the academic journal. For more complete paper I’d like to suggest that the abstract should be changed by adding the theoretical contribution.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #2: No

**********

Acceptance letter

George Vousden

5 Aug 2022

PONE-D-21-18643R1

Appropriability and basicness of R&D: Identifying and characterising product and process inventions in patent data

Dear Dr. Seliger:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. George Vousden

Staff Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 File. Appendix.

    This appendix describes the implementation of the keyword search, the identification of independent and product-by-process claims, and the construction of a measure for growing technologies.

    (PDF)

    S1 Table. Regression results with different basis.

    These tables contain the results from the multinomial logit model with different base categories.

    (PDF)

    Attachment

    Submitted filename: Rebuttal letter.docx

    Data Availability Statement

    The data are available to the public on Harvard Dataverse (DOI: https://doi.org/10.7910/DVN/CBSK2W).


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES