Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2023 Feb 14;13:2586. doi: 10.1038/s41598-023-28766-y

Interpretable patent recommendation with knowledge graph and deep learning

Han Chen 1, Weiwei Deng 2,
PMCID: PMC9929065  PMID: 36788243

Abstract

Patent transfer is a common practice for companies to obtain competitive advantages. However, they encounter the difficulty of selecting suitable patents because the number of patents is increasingly large. Many patent recommendation methods have been proposed to ease the difficulty, but they ignore patent quality and cannot explain why certain patents are recommended. Patent quality and recommendation explanations affect companies’ decision-making in the patent transfer context. Failing to consider them in the recommendation process leads to less effective recommendation results. To fill these gaps, this paper proposes an interpretable patent recommendation method based on knowledge graph and deep learning. The proposed method organizes heterogeneous patent information as a knowledge graph. Then it extracts connectivity and quality features from the knowledge graph for pairs of patents and companies. The former features indicate the relevance of the pairs while the latter features reflect the quality of the patents. Based on the features, we design an interpretable recommendation model by combining a deep neural network with a relevance propagation technique. We conduct experiments with real-world data to evaluate the proposed method. Recommendation lists with varying lengths show that the average precision, recall, and mean average precision of the proposed method are 0.596, 0.636, and 0.584, which improve corresponding performance of best baselines by 7.28%, 18.35%, and 8.60%, respectively. Besides, our method interprets recommendation results by identifying important features leading to the results.

Subject terms: Information technology, Scientific data

Introduction

Patent transfer refers to the movement of patent rights from one party to another. With the advent of open innovation, patent transfer has become a common practice for companies to obtain competitive advantages. Both patent owners and demanders have incentives to transfer patents. For the former, patent transfer provides monetary benefits (e.g., royalty payment) and strategic benefits (e.g., strengthening companies’ product market and technological positions)1. For the latter, patent transfer reduces the risks and costs associated with technological innovations, provides industry standards and advanced technologies, and facilitates technological learning2. Previous research has shown that patent transfer has significant impacts not only on companies’ performance but also on national economies2,3. For example, Han and Lee4 demonstrated that patent transfer significantly increases the market value of firms in Korea. Kwon5 found that firms’ patent transfer makes their rivals deter innovative activities. Roessner et al.6 combined university licensing data of the United States (US) from 1996 to 2010 with economic input–output models and found that the contribution of university licensing to the US economy during that period was at least 162.1 billion dollars.

Due to the importance of patent transfer, online patent platforms have been developed to facilitate the transfer process7,8. However, patent demanders still face an information overload problem as the number of patents is getting large. The number of yearly granted patents has exceeded 0.5 million since 2005 and reached 1.5 million in 2019 as recorded by the World Intellectual Property Organization (https://www.wipo.int/publications/en/details.jsp?id=4526&plang = EN). Consequently, the cost of searching for suitable patents is high. Many patent recommendation methods have been developed to suggest suitable patents to users913. However, previous methods fail to consider patent quality and explain recommendation results. The quality of patents reflects their value and is essential to firms. Previous studies show that patent quality is positively correlated to firms’ stock returns14, high-quality patents can protect firms from patent trolls and product infringement15, and the uncertainty of patent quality may cost enterprises billions16. Interpretability of the recommendation results also matters. Patents have complex information and cost companies much to select a suitable one. The interpretability increases information transparency and thus promotes persuasiveness and acceptance of recommender systems17. Besides, interpretable recommendations benefit users and platforms because of the improved user-friendliness18. Therefore, providing interpretable recommendations is necessary to improve companies’ decision-making in selecting suitable patents.

To bridge these gaps, this research proposes an interpretable patent recommendation method to facilitate companies to select suitable patents. The proposed method organizes heterogeneous patent information as a knowledge graph, a graph-structured knowledge base that enables efficient integration and semantic interpretation of heterogeneous information19. It then extracts two types of features from the knowledge graph, i.e., connectivity features and quality features that indicate the relevance of company-patent pairs and patent quality. Last, the proposed method builds an interpretable deep learning model for recommending patents and providing explanations to companies. Deep learning is popular in personalized recommendations for its superior performance, but it lacks interpretability. Therefore, this research develops an interpretable deep learning model by combining a deep neural network (DNN) with a layer-wise relevance propagation technique (LRP)20. DNN predicts the probabilities that target companies will select candidate patents while LRP generates interpretations by identifying the features that contribute the most to the predictions. The proposed method is evaluated with data obtained from the United States Patent and Trademark Office (USPTO, https://www.uspto.gov/learning-and-resources/electronic-data-products/patent-assignment-dataset) and PatentsView (http://www.patentsview.org/download/) databases. The evaluation results show that the proposed method not only outperforms the state-of-the-art patent recommendation methods in terms of the precision, recall, and mean average precision (MAP) measures but also provides valid interpretations.

The rest of this paper is organized as follows. Section "Related work" reviews studies on patent recommendations and patent quality analysis. Section "The proposed patent recommendation method" presents the proposed knowledge graph-based method for interpretable patent recommendations. Section "Experimental evaluation" introduces the details of the experimental evaluation. Section "Results and discussions" presents and discusses the evaluation results. Section "Conclusions" concludes this research with major contributions and possible future works.

Related work

Patent recommendations

Recommendation methods have been widely used to solve information overload problems in different contexts by proactively delivering personalized recommendations21. In the patent domain, recommendation methods have also been proposed to help users to find suitable patents. Previous patent recommendation methods differ mainly in application contexts, the information involved in the recommendations, and the techniques applied for the recommendations. The application contexts mainly contain query-driven patent search and patent transfer contexts8. The former provides specific user needs like keywords or patent documents while the latter requires the mining of company needs9,22. Patents have heterogeneous information that can be used for recommendation, including texts (e.g., patent titles and abstracts), categories (e.g., the International Patent Classification), interactions (e.g., patent searching and assignment behavior of users), citations, and inventors of patents23. Besides, the applied techniques of patent recommendations can be divided into content-based filtering (CB), collaborative filtering (CF), and hybrid methods12,24. Table 1 summarizes major patent recommendation studies based on the above aspects.

Table 1.

Major studies on patent recommendations.

Studies Contexts Involved information Applied techniques Interpretability
Texts Categories Citations Inventors Interactions Quality
Chen and Chiu 25 C1 CB
Ji et al.26 C1 Hybrid
Sooyoung Oh et al.27 C1 Hybrid
Trappey et al.28 C1 CF
Krestel and Smyth29 C1 CB
Oh et al.30 C1 CF
Mahdabi and Crestani31 C1 Hybrid
Fu et al.23 C1 Hybrid
Deng et al.32 C1 CB
Rui and Min33 C1 CB
Chen et al.9 C1 CB
Trappey et al.12 C1 CB
Wang et al.8 C2 Hybrid
He et al.10 C2 Hybrid
Deng and Ma (2021) C2 CB
Du et al.22 C2 Hybrid
The proposed method C2 Hybrid

C1 is the query-driven patent search context, C2 is the patent transfer context, CB represents content-based recommendation methods, and CF represents collaborative filtering recommendation methods.

Several conclusions and research gaps can be identified from Table 1. First, previous studies mainly focus on the query-driven patent search context and pay less attention to the patent transfer context. In the former context, users provide explicit needs and care little about patent quality. For example, patent inventors provide key terms to search relevant prior arts before applying patents28. Patent examiners use patent applications to search missing references and examine the novelty of the applications23. Patent users identify similar patents for a given target patent9. Only several methods have been proposed for the patent transfer context and most of them were proposed recently. This situation indicates the increasing importance of patent transfer and calls for greater attention to improving patent transfer.

Second, previous studies have used various patent information but ignored patent quality. As shown in Table 1, previous studies on patent recommendations have different types of patent information, including texts, categories, citations, inventors, and interactions. However, they failed to consider patent quality, which is important to companies’ value and competitiveness14,15. Ignoring the quality of acquired patents can impede business and result in costly lawsuits16. Therefore, incorporating patent quality is imperative to recommending suitable patents to companies.

Third, CB and hybrid methods gain more popularity than CF methods in patent recommendation. CF methods recommend patents based on like-minded users. For example, Trappey et al.28 used a user-based CF method to identify users with similar patent searching behaviors and recommend patents that are liked by similar users. CF methods are less suitable in the technology transfer context because most patents are transferred only once to only one company. Consequently, CF methods suffer from the data sparsity problem in the technology transfer context. CB methods recommend patents that are similar to the patents liked by target users. For example, Krestel and Smyth29 used a latent Dirichlet allocation model to recommend patents with topics similar to a query patent. Chen and Chiu25 developed an IPC-based vector space model to find similar patents for a given patent. CB methods are suitable for the query-driven context because it mainly requires the matching between patents and queries. However, CB methods have difficulty in involving heterogeneous information and face an over-specialization problem, i.e., they recommend only patents with features previously liked by target users. Hybrid methods combine different methods to overcome their disadvantages. One frequently used hybrid mechanism is to learn a unified model based on the collaborative and content information. For example, Mahdabi and Crestani31 developed a citation query model which combines information including patent text, citations, categories, and inventors. Hybrid methods are popular in patent recommendation because of their advantages in overcoming the drawbacks of CF and CB methods. Consequently, this study proposes a hybrid method that incorporates heterogeneous patent information for patent recommendation.

Last, previous studies ignore the interpretability of patent recommendations. Given the importance of interpretability to companies’ decision-making17,18, there is a need for endowing recommendation results with interpretability. Therefore, this study develops an interpretable patent recommendation method, which can identify the features that contribute the most to each recommendation. Based on the identified features, we can explain why a specific patent is recommended to a specific company.

Patent quality analysis

Patent quality analysis focuses on identifying potential indicators of patent quality. Multiple indicators of patent quality have been proposed34. Among the proposed indicators, we summarize eight frequently used indicators, i.e., the number of forward citations, the number of backward citations, the number of patent claims, patent scope, previous transfer, patent family size, patent generality, and patent originality.

Forward citations are the citations that patents receive. A higher number of forward citations indicates a greater value of current inventions and their importance for subsequent technologies35,36. A patent’s backward citations are the references the patent has. More references indicate the more developed a technological field is and more incrementally the patent contributes to the field. Previous studies36,37 have consistently shown that there is a positive correlation between the number of backward citations and patent quality. A patent’s claims delineate the property rights protected by the patent and a larger number of patent claims indicates higher patent quality38. Patent scope is defined as the number of distinct patent classes to which patents are allocated. The broader the patent scope, the more attractive the patents’ exclusive rights, and hence the higher quality of the patents38. Previous transfer refers to the number of times a patent has been previously traded. It indicates the patent’s value as recognized by the technological market39. Patent family size is the number of countries in which a patented invention is protected. A larger patent family size indicates a higher quality of patents because the patented technologies are protected in a broader geographical scope40. Patent generality measures the generality of a patent’s impact. A patent has a general impact if it is cited by other patents that belong to a wide range of technological fields40,41. Patent originality measures the originality of a patent. A patent is believed to be more original if it relies on diverse knowledge sources (i.e., its backward citations belong to a wide range of technological fields)40,41. According to previous research, this study uses these eight indicators to measure patent quality.

The proposed patent recommendation method

Overview of the proposed method

Figure 1 shows the framework of the proposed patent recommendation method. In the proposed method, we organize heterogeneous patent information as a patent knowledge graph that stores the information in the form of semantic relations between entities. The proposed method comprises three modules, namely, knowledge graph construction, feature extraction, and patent recommendation. The first module extracts patent and company information to construct a knowledge graph. The second module extracts, from the knowledge graph, connectivity and quality features that indicate the relevance between companies and patents and the quality of patents, respectively. Given the extracted features, the third module develops an interpretable patent recommendation model to recommend suitable patents and to provide interpretations to target companies. The following subsections present the details of these three modules.

Figure 1.

Figure 1

Overview of the proposed patent recommendation method.

Knowledge graph construction

Concepts and definitions

Definition 1 (Knowledge Graph). A knowledge graph is a graph-structured knowledge base denoted as G=h,r,tE×R×E, where E is a set of entities, R is a set of relations, h,tE, and rR. A triple h,r,t represents the fact that there is a relation r between entities h and t. For example, patenti,hasterm,termj indicates the fact that patenti contains termj.

Definition 2 (Meta Path). A meta path is a type of composite relations between two types of entities. It is denoted as P=H,r1,E1,r2,,rl,T, where H and T are the types of head and tail entities respectively, rlR is a relation, E1 is the type of entities reached through r1, and l denotes the length of the meta path. Meta paths capture the semantic connections between entities and can be used to measure relevance between entities. For example, (Patent, has_term, Term, has_term, Patent, is_transfered_to, Company) is a meta path between patents and companies. The meta path indicates that the head patents are relevant to the companies because the head patents have the same terms as the patents transferred to the companies. A meta path can contain multiple concrete paths. For example, (patent1, has_term, term1, has_term, patent2, is_transfered_to, company1) and (patent1, has_term, term2, has_term, patent2, is_transfered_to, company1) are two concrete paths that belong to the above meta path.

Knowledge graph

A domain-specific knowledge graph is constructed for patent recommendation in the current context. The types of facts in the knowledge graph are shown in Fig. 2. Specifically, the knowledge graph comprises seven types of entities (i.e., patents, companies, inventors, terms, categories, countries, and claims) and eight types of relations (i.e., patenti cites patentj, patenti is invented by inventorj, patenti is transferred to companyj, patenti has term termj, patenti has category categoryj, patenti is protected in countryj, and patenti has claim claimj).

Figure 2.

Figure 2

The graph schema of the knowledge graph.

The knowledge graph is constructed by extracting the above types of entities and relations from patent databases. For example, we can extract, from Google’s patent database, two patents “5,208,847” and “4,893,327”, a company “Rockstar Bidco”, an inventor “Daniel L. Allen”, two terms “cellular” and “network”, two categories “H04W16/00” and “H04W8/26”, a country “the US”, and a claim “A method for providing an overlay cellular network”. When extracting terms from patents, text-processing techniques developed by Stanford42 are used to converse cases, remove stop words, standardize derivate words, etc. We can further extract relations between these entities, i.e., “5,208,847” cites “4,893,327”, “5,208,847” is transferred to “Rockstar Bidco”, “5,208,847” is invented by “Daniel L. Allen”, “5,208,847” has terms “cellular” and “network”, “5,208,847” has categories “H04W16/00” and “H04W8/26”, “5,208,847” is protected in “the US”, and “5,208,847” has a claim “A method for providing an overlay cellular network”. The knowledge graph is an accumulative knowledge base and can include more facts to be extracted.

Patent data except terms are structured and can be imported from public databases (e.g., the USPTO and PatentsView databases) to the knowledge graph. For patent terms, this study uses the rapid automatic keyword extraction (RAKE) method43 to extract three representative terms from the titles and abstracts of patents. RAKE is an unsupervised and domain-independent method for term extraction. It has higher accuracy and efficiency than other term extraction methods and thus is selected in this study.

Feature extraction

Connectivity features

Connectivity features are the connections between patents and companies and indicate their relevance based on various connecting entities. For example, a path (patent1, cites, patent2, is_transfered_to, company1) indicates that patent1 is relevant to company1 because it cites a patent transferred to the company. We define meta paths that connect companies to patents as the connectivity features. Meta paths involving claims are not considered because patents have unique claims, meaning that these meta paths have no concrete paths connecting any patent to any company. Besides, this study considers only meta paths within a certain length (e.g., 3) because long meta paths provide trivial information while causing high computational costs44. Therefore, from the constructed knowledge graph, this study extracts six connectivity features, which are presented along with their descriptions in Table 2.

Table 2.

Connectivity features and their descriptions.

No Connectivity features Descriptions
P1 PatenthastermTermhastermPatentistransferredtoCompany The head patents have the same terms as the patents that were transferred to the companies
P2 PatenthascategoryCategoryhascategoryPatentistransferredtoCompany The head patents have the same categories as the patents that were transferred to the companies
P3 PatentcitesPatentistransferredtoCompany The head patents have citation relation with the patents that were transferred to the companies
P4 PatentcitesPatentcitesPatentistransferredtoCompany The head patents have indirect citation relation with the patents that were transferred to the companies
P5 PatentisinventedbyInventorisinventedbyPatentistransferredtoCompany The head patents share the same inventors as the patents that were transferred to the companies
P6 PatentistransferredtoCompanyistransferredtoPatentistransferredtoCompany The head patents share the same assignees with the patents that were transferred to the tail companies
P7 PatentisprotectedinCountryisprotectedinPatentistransferredtoCompany The head patents are protected in the same countries as the patents that were transferred to the tail companies

The connectivity features need to be quantified before being used for patent recommendations. Given a patent-company pair patenti,companyj, let #pathijPk be the number of concrete paths that connect patenti to companyj and belong to connectivity feature Pk. Then, this study quantifies the feature between patenti and companyj as follows:

xijPk=#pathijPk-minrPatj#pathrjPkmaxrPat#pathrjPk-minrPat#pathrjPk 1

where Patj is a set of candidate patents for companyj, maxrPatj#pathrjPk and minrPatj#pathrjPk are the maximum and minimum numbers of concrete paths that belong to Pk and connect any patentr in Pat to companyj, and xijPk equals 0 if maxrPat#pathrjPk=minrPatj#pathrjPk. xijPk reflects the connection strength between patenti and companyj through Pk considering all candidate patents that can connect companyj via Pk.

Quality features

The quality features indicate the quality of patents. As discussed in Section "Patent quality analysis", eight quality indicators are used to quantify patent quality and are mathematically defined as follows.

First, the number of forward citations (NFC). It suffers from the anti-recency problem that older patents have longer periods to accumulate forward citations than newer patents. To address the problem, we consider the publication time of patents and define NFC as follows:

NFCpatenti=j=1Patent_set#patentj,cites,patentiy, 2

where Patent_set is the set of patents in the knowledge graph, #patentj,cites,patenti equals 1 if patentj,cites,patenti exists in the knowledge graph and 0 otherwise, j=1Patent#patentj,cites,patenti counts the number of patents that cite patenti, and y is the number of years since the publication of patenti.

Second, the number of backward citations (NBC). It is defined as follows:

NBCpatenti=j=1Patent_set#patenti,cites,patentj, 3

where j=1Patent_set#patenti,cites,patentj counts the number of patents cited by patenti.

Third, the number of claims (NCL). It is defined as follows:

NCLpatenti=j=1Claim_set#patenti,has_claim,claimj, 4

where Claim_set is the set of claims in the knowledge graph and j=1Claim_set#patenti,has_claim,claimj counts the number of claims contained by patenti.

Fourth, patent scope (PS). It is defined as follows:

PSpatenti=j=1Category_set#patenti,has_category,categoryj, 5

where Category_set is the set of categories in the knowledge graph and j=1Category_set#patenti,has_category,categoryj counts the number of categories owned by patenti.

Fifth, previous transfers (PT). It is defined as follows:

PTpatenti=j=1Company_set#patenti,is_transferred_to,companyj, 6

where Company_set is the set of companies in the knowledge graph and j=1Company_set#patenti,is_transferred_to,companyj counts the number of companies to which patenti has been transferred.

Sixth, patent family size (PFS). It is defined as follows:

PFSpatenti=j=1Country_set#patenti,is_protected_in,countryj, 7

where Country_set is the set of countries in the knowledge graph and j=1Country_set#patenti,is_protected_in,countryj counts the number of companies in which patenti is protected.

Seventh, patent generality (PG). Eighth, patent originality (PO). These two indicators are defined as follows:

PGpatenti=category|#patentm,hascategory,category=1,patentmPatent_seti,fPatent_seti,f, 8
POpatenti=category|#patentm,hascategory,category=1,patentmPatent_seti,bPatent_seti,b, 9

where Patent_seti,f (Patent_seti,b) denotes the set of forward (backward) citations of patenti, category|#patentm,hascategory,category=1,patentmPatent_seti,f is the set of unique categories to which these forward citations belong, and set is the number of items in the set. PGpatenti (POpatenti) counts the average number of unique categories to which these forward (backward) citations belong.

Patent recommendation

In the patent recommendation module, we design an interpretable recommendation model based on DNN and LRP techniques. Given a set of patent-company pairs, the study transmits their connectivity and quality features to a basic DNN, a feed-forward neural network with multiple hidden layers. The DNN maps the features non-linearly from its input layer to a sequence of hidden layers and, at last, its output layer, which produces the probability that a specific company will select a specific patent. Let a0 be the input vector of the input layer of an L-layer DNN and al (l=1,2,3,,L) be the output vector of its l-th layer (the output of the (l-1)-th layer is the input of the l-th layer). Then, the DNN is defined as follows:

al=ϕlωlal-1+bl, 10

where ϕl represents an activation function that performs a nonlinear mapping from the (l-1)-th layer to the l-th layer, ωl is a weight matrix for the mapping, and bl is a bias vector for the l-th layer. For hidden layers, we use the rectified linear unit (ReLU) as the activation function because it overcomes gradient vanish. The activation function of the output layer is Sigmoid because it constrains the output value within the range of (0, 1). After training the DNN, we use it to estimate the probabilities that target companies will select candidate patents and thus generate recommendation lists for the target companies.

The DNN is a black-box model, which fails to explain why certain patents are recommended to a target company. To empower the DNN with interpretability, we combine it with the LRP technique, which infers the relevance of input features to the output value of the DNN. The core mechanism of LRP is that it assigns relevance scores to the input features by tracing their contributions, layer by layer, back to the output value. Consequently, we can interpret recommendation results by describing which pieces of information dominate in generating each recommendation. The relevance of each input feature is inferred as follows:

ril-1=jai,jl-1/kak,jl-1rjl, 11

where rjl is the relevance score of neuron j of layer l, ai,jl-1 is the contribution made by neuron i of layer (l-1) to neuron j of layer l in the forward propagation, and kak,jl-1 is the total contribution made by all neurons of layer (l-1) to neuron j of layer l in the forward propagation. After obtaining the relevance scores of the input features for each recommendation, we can visualize them for better interpretation.

graphic file with name 41598_2023_28766_Figa_HTML.jpg

Experimental evaluation

Experimental setup

Experiments are conducted on a computer with a Windows 11 operating system and an AMD Ryzen 5 PRO 4650U @ 2.10 GHz CPU. We implement codes using Python, TensorFlow, and other tools or packages like NumPy, Keras, Pandas, etc. The model optimizer, learning rate, and batch size are set to the Adam optimizer, 0.001, and 128, respectively. Besides, we set the dropout rate of turning off neurons to 0.2 to avoid overfitting.

We evaluate the proposed patent recommendation method with real-world data from the USPTO and PatentsView databases. The former contains information of patent transactions, namely, which patents have been assigned to which patent assignees. The latter includes details of patents, such as their titles, abstracts, categories, inventors, and references. The two share the same identities of patents and thus can be merged based on the identities. To evaluate the proposed method, we randomly select 500 companies that have at least 50 records of patent assignments and collect all patents owned by the companies from the USPTO database. Then, we collect patent information from the PatentsView database, namely, inventors, claims, categories, references, terms, and protected countries. The collected information constitutes a knowledge graph for the selected companies and its statistics are reported in Table 3.

Table 3.

Statistics of the knowledge graph.

Entity types Number of instances Relation types Number of instances
Companies 227,074 is_transferred_to 1,819,490
Patents 907,661 cites 14,467,861
Countries 123 is_protected_in 1,123,087
Inventors 816,154 is_invented_by 2,229,005
Claims 14,548,355 has_claim 14,548,355
Categories 177,017 has_category 8,631,105
Terms 1,420,766 has_term 2,497,322

For each selected company, we consider the first 50% of patents it purchased as known information in the knowledge graph and keep the rest unknown for model training or testing. This study randomly chooses 70% of the selected companies, along with their hypothetically unknown patents, as training data and the rest as test data.

Baseline methods

To evaluate the effectiveness of the proposed method (denoted as Ours), we compare it with the following baselines.

DNN+C a deep neural network with the connectivity features, which indicate the relevance between patents and companies. This method evaluates the role of the relevance in recommending suitable patents without considering their quality.

DNN+Q a deep neural network with the quality features, which reflect the quality of patents. The baseline evaluates the role of patent quality in recommending patents to companies without considering their relevance to the companies.

CF a collaborative filtering method that uses a user-item matrix to identify the nearest neighbors of a target user and leverages their preferences to infer the preference of the target user.

CB a content-based filtering method that represents patents and companies as vectors of terms and identifies relevant patents for target companies by calculating the cosine similarities of their vectors.

SVM a support vector machine model with the connectivity and quality features.

RF a random forest model with the connectivity and quality features.

HIN a patent recommendation method that leverages meta paths in knowledge graph for inferring the relevance between companies and patents8.

RippleNet a state-of-the-art recommendation method that leverages knowledge graph and deep learning for recommendations45.

Comparing the proposed method with the first two baselines reflects whether combing the connectivity and quality features outperforms using only one type of features. The remaining baselines are compared to evaluate whether the proposed method has superior performance in recommending patents to companies.

Evaluation metrics

This study selects three common metrics to evaluate the recommendation performance, namely, precision, recall, and MAP. Given a target company, precision shows the percentage of recommended patents that were transferred to the company. Recall represents the ratio of correctly recommended patents to the patents that were transferred to the company. Precision and recall fail to consider the rankings of correctly recommended patents in recommendation lists. MAP overcomes the drawback by considering both the precision and the rankings of recommended patents. A high MAP means that many suitable patents are recommended and they rank high on recommendation lists. We use RS to denote the set of patents recommended to the target company, TS the set of patents that were transferred to the target company, and RSTS the set of patents in both RS and TS. Then, we define the above metrics as follows:

precision@k=RSTSRS, 12
recall@k=RSTSTS, 13
MAP@k=1RS×i=1RSreli×precision@i, 14

where @k means the top k patents recommended to companies, RS and TS represent the number of patents in corresponding sets, and reli equals 1 if the i-th patent of RS is in TS and 0 otherwise.

Results and discussions

Recommendation results

To evaluate the effectiveness of the proposed method, we compare its recommendation performance with the baseline methods. We vary the number of recommended patents from 10 to 50 and present recommendation results in Table 4. The table uses bold numbers to represent the best performance of corresponding metrics and numbers in brackets to indicate the improvement of the proposed method compared to the best baseline method.

Table 4.

Recommendation performance of different methods.

Metrics Ours CF CB SVM RF HIN RippleNet
Precision@10 0.841 (7.50%) 0.159 0.403 0.778 0.782 0.738 0.767
Precision@20 0.667 (9.10%) 0.159 0.351 0.606 0.612 0.579 0.597
Precision@30 0.556 (7.65%) 0.151 0.314 0.509 0.517 0.489 0.501
Precision@40 0.484 (6.49%) 0.145 0.282 0.443 0.454 0.430 0.439
Precision@50 0.430 (5.67%) 0.141 0.257 0.393 0.407 0.384 0.390
Recall@10 0.468 (18.78%) 0.028 0.085 0.391 0.394 0.348 0.379
Recall@20 0.608 (20.15%) 0.047 0.129 0.498 0.506 0.443 0.477
Recall@30 0.667 (18.58%) 0.063 0.157 0.552 0.563 0.494 0.526
Recall@40 0.706 (17.63%) 0.074 0.174 0.584 0.600 0.528 0.556
Recall@50 0.730 (16.62%) 0.087 0.186 0.607 0.626 0.550 0.575
MAP@10 0.829 (8.56%) 0.139 0.396 0.766 0.764 0.726 0.759
MAP@20 0.655 (10.36%) 0.132 0.344 0.592 0.593 0.567 0.589
MAP@30 0.545 (9.09%) 0.123 0.308 0.495 0.499 0.478 0.494
MAP@40 0.473 (7.99%) 0.117 0.276 0.429 0.438 0.419 0.432
MAP@50 0.419 (6.98%) 0.112 0.251 0.380 0.391 0.374 0.384

Bold numbers represent the best performance of corresponding metrics and numbers in brackets indicate the improvement of the proposed method compared to the best baseline method.

Several conclusions can be drawn from the above results. First, the proposed method outperforms the baseline methods in terms of precision, recall, and MAP. This demonstrates the effectiveness and the necessity of considering patent quality in recommending patents to companies. Second, the MAP improvements are higher than the improvements in precision. This means that considering patent quality not only helps to find more patents that are relevant to the companies but also helps to improve the rankings of relevant patents in the recommendation lists. Third, the CF method performs poorly for the patent recommendation. This result meets our expectations because most patents have been transferred to only one company. Hence, the CF method suffers from the data sparsity problem in this recommendation context. Fourth, CF and CB methods perform much worse than the other methods. One likely reason is that only a small part of patent information is involved in the two methods. Fifth, our method outperforms SVM and RF even though they use the same information. This indicates that the deep learning model can better capture companies’ preferences than the traditional machine learning models in the current context.

Interpretability analysis

We conduct qualitative analysis to evaluate the interpretability of our method. Specifically, we design a module that visualizes the importance of features to each recommendation. Figure 3 illustrates the visualization of a randomly selected company and a patent recommended to it. The figure shows the main information based on which the patent is recommended to the company. Specifically, patent family size is the overriding reason, followed by P5 and P3, which can be referred to Table 2. The results show that being protected in multiple countries, having a direct citation relation, as well as having common inventors, with the patents owned by the company are the main reasons why the patent is recommended. In contrast, P6, the number of forward citations, and the number of backward citations barely impact the recommendation. Averagely, the connectivity features contribute more to the recommendation than the quality features.

Figure 3.

Figure 3

An illustration of the visualization module.

Ablation study

To analyze the impacts of key components of the proposed method in generating accurate and interpretable results, we conduct additional experiments. First, to examine the role of connectivity and quality features in patent recommendation, we experiment with the proposed method using different features and present results in Fig. 4. The results show that connectivity features generate good performance, meaning that they are helpful in finding relevant patents for companies. On the contrary, the recommendation performance of DNN+Q is about 0, meaning that quality features alone lead to very bad performance. This is because many high-quality patents are irrelevant to the business of target companies and are unnecessary for the companies. Consequently, recommending these patents to the target companies results in low performance. Besides, combining both types of features yields the best performance. This confirms the need of considering both types of features in patent recommendation.

Figure 4.

Figure 4

Recommendation performance of the proposed method with different features.

Second, we examine the impacts of important features on recommendation performance. This also helps to evaluate the interpretability of our method. To this end, we remove features identified as important from the proposed method and track its changes in recommendation performance. Specifically, we remove the top five most important features one by one and measure the resulting recommendation performance, which is presented in Fig. 5. Results show that the recommendation performance decreases as the number of removed features increases. However, the drops in performance may simply be caused by discarding features. To demonstrate whether removing the identified important features leads to larger drops compared to removing the unimportant features, we remove the least important features one by one and calculate the resulting recommendation performance.

Figure 5.

Figure 5

Recommendation performance after removing important features.

Figure 6 presents the recommendation results of removing the least important features one by one. The results show that removing the least important features also leads to drops in recommendation performance, but the drops are much smaller than those of removing the most important features. Besides, the performance difference between removing important and unimportant features widens as the number of removed features increases. These observations show that the identified important features do contribute to the recommendations and removing them leads to dramatic drops in precision, recall, and MAP. The results confirm that the identified important features are more significant to recommendation results and can be used to interpret the results.

Figure 6.

Figure 6

Recommendation performance after removing unimportant features.

Based on the above analysis and discussion, we conclude the impact of the proposed method on the research problems. Regarding the problem that previous patent recommendation methods ignore patent quality, the proposed method applies eight quality features to quantify patent quality and involves it in the recommendation process. Experimental results demonstrate that considering patent quality improves patent recommendation performance. In terms of lacking interpretability, the proposed method addresses it by combining a deep neural network with a relevance propagation technique, which can identify important features to each recommendation. Consequently, our method can generate explanations based on the important features and thus ease the lacking of interpretability.

Conclusions

In this study, we propose an interpretable patent recommendation method based on knowledge graph and deep learning. The proposed method organizes various patent information as a knowledge graph. It then defines and extracts connectivity and quality features that indicate the relevance of patent-company pairs and patent quality. Given the extracted features, we design an interpretable recommendation model by combining DNN and LRP techniques. We conduct experiments to evaluate the proposed method and experimental results show that its average precision@k, recall@k, and MAP@k (k = 10, 20, 30, 40, 50) are 0.596, 0.636, and 0.584, respectively. The results are 7.28%, 18.35%, and 8.60% higher than those of best baseline methods. We qualitatively and quantitatively analyze the interpretability of the proposed method. The qualitative and quantitative analyses show that our method can provide interpretable results.

This research extends existing studies on patent recommendations. First, this is the first study that combines knowledge graph, DNN, and LRP techniques for effective and interpretable patent recommendations. Knowledge graph models heterogeneous patent data and facilitates feature extraction and interpretation. DNN enables better patent recommendation by capturing nonlinear and nontrivial relationships between patents and companies. LRP endows our patent recommendation model with interpretability by identifying features that contribute the most to recommendation results. Experiments show that our method is superior to baseline methods. Second, this research incorporates patent quality into patent recommendation. Experiments demonstrate its effectiveness in enhancing recommendation performance. Third, this study provides a practical solution to facilitate technology transfer. Through the proposed patent recommendation method, companies can find suitable patents with explanations.

Future research can extend this study from different aspects. The first one is to incorporate more company information into the patent recommendation model. For example, the industries and business scopes of companies can be used to identify relevant patents. Future work can include such information into knowledge graph and extract more features for patent recommendations. The second one is to include patent portfolios into patent recommendations because companies may consider their existing patents when obtaining new ones. Future research can design new recommendation models with features that indicate the relationships between candidate patents and the existing patents.

Acknowledgements

We acknowledge the financial support from the Natural Science Foundation of Guangdong Province (No. 2022A1515011363), the Guangdong Philosophy and Social Science Planning Project (No.GD21CTS05), the Guangzhou Science and Technology Plan Project (No. 202002030384), and the 2021 Major Cultivation Project of Philosophy and Social Sciences in South China Normal University.

Author contributions

H.C. wrote the manuscript, W.D. proposed the method, conducted experiments, and analyzed results. All authors reviewed the manuscript.

Data availability

The data used in this paper are public and can be downloaded from the following links: https://www.uspto.gov/learning-and-resources/electronic-data-products/patent-assignment-dataset and http://www.patentsview.org/download/.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Lichtenthaler U. Technology exploitation in the context of open innovation: Finding the right ‘job’ for your technology. Technovation. 2010;30:429–435. doi: 10.1016/j.technovation.2010.04.001. [DOI] [Google Scholar]
  • 2.Lee J-S, Park J-H, Bae Z-T. The effects of licensing-in on innovative performance in different technological regimes. Res. Policy. 2017;46:485–496. doi: 10.1016/j.respol.2016.12.002. [DOI] [Google Scholar]
  • 3.Wang Y, Ning L, Chen J. Product diversification through licensing: Empirical evidence from Chinese firms. Eur. Manag. J. 2014;32:577–586. doi: 10.1016/j.emj.2013.09.001. [DOI] [Google Scholar]
  • 4.Han J-S, Lee S-YT. The impact of technology transfer contract on a firm’s market value in Korea. J. Technol. Transf. 2013;38:651–674. doi: 10.1007/s10961-012-9257-8. [DOI] [Google Scholar]
  • 5.Kwon S. How does patent transfer affect innovation of firms? Technol. Forecast. Soc. Change. 2020;154:119959. doi: 10.1016/j.techfore.2020.119959. [DOI] [Google Scholar]
  • 6.Roessner D, Bond J, Okubo S, Planting M. The economic impact of licensed commercialized inventions originating in university research. Res. Policy. 2013;42:23–34. doi: 10.1016/j.respol.2012.04.015. [DOI] [Google Scholar]
  • 7.Galasso A, Schankerman M, Serrano CJ. Trading and enforcing patent rights. RAND J. Econ. 2013;44:275–312. doi: 10.1111/1756-2171.12020. [DOI] [Google Scholar]
  • 8.Wang Q, Du W, Ma J, Liao X. Recommendation mechanism for patent trading empowered by heterogeneous information networks. Int. J. Electron. Commer. 2019;23:147–178. doi: 10.1080/10864415.2018.1564549. [DOI] [Google Scholar]
  • 9.Chen J, Chen J, Zhao S, Zhang Y, Tang J. Exploiting word embedding for heterogeneous topic model towards patent recommendation. Scientometrics. 2020;125:2091–2108. doi: 10.1007/s11192-020-03666-4. [DOI] [Google Scholar]
  • 10.He X, et al. Weighted meta paths and networking embedding for patent technology trade recommendations among subjects. Knowl. Based Syst. 2019;184:104899. doi: 10.1016/j.knosys.2019.104899. [DOI] [Google Scholar]
  • 11.Moeller A, Moehrle MG. Completing keyword patent search with semantic patent search: introducing a semiautomatic iterative method for patent near search based on semantic similarities. Scientometrics. 2015;102:77–96. doi: 10.1007/s11192-014-1446-9. [DOI] [Google Scholar]
  • 12.Trappey A, Trappey CV, Hsieh A. An intelligent patent recommender adopting machine learning approach for natural language processing: A case study for smart machinery technology mining. Technol. Forecast. Soc. Change. 2021;164:120511. doi: 10.1016/j.techfore.2020.120511. [DOI] [Google Scholar]
  • 13.Zhang, Y. et al. Semantic based heterogeneous information network embedding for patent citation recommendation. in 2020 International Conference on Artificial Intelligence and Computer Engineering (ICAICE) 518–527 (2020). 10.1109/ICAICE51518.2020.00106.
  • 14.Shu T, Tian X, Zhan X. Patent quality, firm value, and investor underreaction: Evidence from patent examiner busyness. J. Financ. Econ. 2022;143:1043–1069. doi: 10.1016/j.jfineco.2021.10.013. [DOI] [Google Scholar]
  • 15.Trappey AJC, Trappey CV, Wu C-Y, Lin C-W. A patent quality analysis for innovative technology and product development. Adv. Eng. Inform. 2012;26:26–34. doi: 10.1016/j.aei.2011.06.005. [DOI] [Google Scholar]
  • 16.Leiponen A, Delcamp H. The anatomy of a troll? Patent licensing business models in the light of patent reassignment data. Res. Policy. 2019;48:298–311. doi: 10.1016/j.respol.2018.08.019. [DOI] [Google Scholar]
  • 17.Liu, N. et al. Explainable recommender systems via resolving learning representations. in Proceedings of the 29th ACM International Conference on Information & Knowledge Management 895–904 (Association for Computing Machinery, 2020). 10.1145/3340531.3411919.
  • 18.Zheng, Y. et al. Disentangling User Interest and Conformity for Recommendation with Causal Embedding. arXiv:2006.11011 [cs] (2021).
  • 19.Oh, B., Seo, S. & Lee, K.-H. Knowledge graph completion by context-aware convolutional learning with multi-hop neighborhoods. in Proceedings of the 27th ACM International Conference on Information and Knowledge Management 257–266 (2018). 10.1145/3269206.3271769.
  • 20.Montavon G, Binder A, Lapuschkin S, Samek W, Müller K-R. Layer-wise relevance propagation: an overview. In: Samek W, Montavon G, Vedaldi A, Hansen LK, Müller K-R, editors. Explainable AI: Interpreting, Explaining and Visualizing Deep Learning. Springer International Publishing; 2019. pp. 193–209. [Google Scholar]
  • 21.Deldjoo Y, Schedl M, Cremonesi P, Pasi G. Recommender Systems Leveraging Multimedia Content. ACM Comput. Surv. 2020;53:106:1–106:38. [Google Scholar]
  • 22.Du W, Jiang G, Xu W, Ma J. Sequential patent trading recommendation using knowledge-aware attentional bidirectional long short-term memory network (KBiLSTM) J. Inf. Sci. 2021 doi: 10.1177/01655515211023937. [DOI] [Google Scholar]
  • 23.Fu, T., Lei, Z. & Lee, W. Patent citation recommendation for examiners. in 2015 IEEE International Conference on Data Mining 751–756 (2015). 10.1109/ICDM.2015.151.
  • 24.Deng W, Ma J. A knowledge graph approach for recommending patents to companies. Electron. Commer. Res. 2021 doi: 10.1007/s10660-021-09471-2. [DOI] [Google Scholar]
  • 25.Chen Y-L, Chiu Y-T. An IPC-based vector space model for patent retrieval. Inf. Process. Manag. 2011;47:309–322. doi: 10.1016/j.ipm.2010.06.001. [DOI] [Google Scholar]
  • 26.Ji, X., Gu, X., Dai, F., Chen, J. & Le, C. Patent collaborative filtering recommendation approach based on patent similarity. in 2011 Eighth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD) Vol. 3 1699–1703 (2011).
  • 27.Oh, S., Lei, Z., Lee, W.-C., Mitra, P. & Yen, J. CV-PCR: a context-guided value-driven framework for patent citation recommendation. in Proceedings of the 22nd ACM International Conference on Conference on Information & Knowledge Management 2291–2296 (ACM, 2013). 10.1145/2505515.2505659.
  • 28.Trappey AJC, Trappey CV, Wu C-Y, Fan CY, Lin Y-L. Intelligent patent recommendation system for innovative design collaboration. J. Netw. Comput. Appl. 2013;36:1441–1450. doi: 10.1016/j.jnca.2013.02.035. [DOI] [Google Scholar]
  • 29.Krestel, R. & Smyth, P. Recommending patents based on latent topics. in Proceedings of the 7th ACM Conference on Recommender Systems 395–398 (ACM, 2013). 10.1145/2507157.2507232.
  • 30.Oh, S., Lei, Z., Lee, W. & Yen, J. Recommending missing citations for newly granted patents. in 2014 International Conference on Data Science and Advanced Analytics (DSAA) 442–448 (2014). 10.1109/DSAA.2014.7058110.
  • 31.Mahdabi, P. & Crestani, F. Query-driven mining of citation networks for patent citation retrieval and recommendation. in Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management 1659–1668 (ACM, 2014). 10.1145/2661829.2661899.
  • 32.Deng N, Chen X, Li D. Intelligent recommendation of Chinese traditional medicine patents supporting new medicine’s R&D. J. Comput. Theor. Nanosci. 2016;13:5907–5913. doi: 10.1166/jctn.2016.5505. [DOI] [Google Scholar]
  • 33.Rui X, Min D. HIM-PRS: A patent recommendation system based on hierarchical index-based MapReduce framework. In: Park JJ, Jong H, Pan Y, Yi G, Loia V, editors. Advances in Computer Science and Ubiquitous Computing. Springer Singapore; 2017. pp. 843–848. [Google Scholar]
  • 34.Squicciarini M, Dernis H, Criscuolo C. Measuring patent. Quality. 2013 doi: 10.1787/5k4522wkw1r8-en. [DOI] [Google Scholar]
  • 35.Ciaramella L, Martínez C, Ménière Y. Tracking patent transfers in different European countries: Methods and a first application to medical technologies. Scientometrics. 2017;112:817–850. doi: 10.1007/s11192-017-2411-1. [DOI] [Google Scholar]
  • 36.Odasso C, Scellato G, Ughetto E. Selling patents at auction: An empirical analysis of patent value. Ind. Corp. Change. 2015;24:417–438. doi: 10.1093/icc/dtu015. [DOI] [Google Scholar]
  • 37.Reitzig M. Improving patent valuations for management purposes—Validating new indicators by analyzing application rationales. Res. Policy. 2004;33:939–957. doi: 10.1016/j.respol.2004.02.004. [DOI] [Google Scholar]
  • 38.Fischer T, Leidinger J. Testing patent value indicators on directly observed patent value—An empirical analysis of Ocean Tomo patent auctions. Res. Policy. 2014;43:519–529. doi: 10.1016/j.respol.2013.07.013. [DOI] [Google Scholar]
  • 39.Serrano CJ. The dynamics of the transfer and renewal of patents. RAND J. Econ. 2010;41:686–708. doi: 10.1111/j.1756-2171.2010.00117.x. [DOI] [Google Scholar]
  • 40.Sreekumaran Nair S, Mathew M, Nag D. Dynamics between patent latent variables and patent price. Technovation. 2011;31:648–654. doi: 10.1016/j.technovation.2011.07.002. [DOI] [Google Scholar]
  • 41.Mathew M, Mukhopadhyay C, Raju D, Nair SS, Nag D. An analysis of patent pricing. Decision; Kolkata. 2012;39:28–39. [Google Scholar]
  • 42.Manning, C. et al. The Stanford CoreNLP natural language processing toolkit. in Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations 55–60 (Association for Computational Linguistics, 2014). 10.3115/v1/P14-5010.
  • 43.Rose, S. J., Cowley, W. E., Crow, V. L. & Cramer, N. O. Rapid automatic keyword extraction for information retrieval and analysis. (2012).
  • 44.Liu H, Jiang Z, Song Y, Zhang T, Wu Z. User preference modeling based on meta paths and diversity regularization in heterogeneous information networks. Knowl. Based Syst. 2019;181:104784. doi: 10.1016/j.knosys.2019.05.027. [DOI] [Google Scholar]
  • 45.Wang, H. et al. RippleNet: Propagating user preferences on the knowledge graph for recommender systems. in Proceedings of the 27th ACM International Conference on Information and Knowledge Management 417–426 (ACM, 2018). 10.1145/3269206.3271739.

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The data used in this paper are public and can be downloaded from the following links: https://www.uspto.gov/learning-and-resources/electronic-data-products/patent-assignment-dataset and http://www.patentsview.org/download/.


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES