Objective To develop a heterogeneous graph prediction method based on the fusion of multi-layer semantics and topological information for addressing the challenges in drug-target interaction prediction, including insufficient modeling of high-order semantic dependencies, lack of adaptive fusion of semantic paths, and over-smoothing of node features. Methods A heterogeneous graph network with multiple types of entities such as drugs, proteins, side effects, and diseases was constructed, and graph embedding techniques were used to obtain low-dimensional feature representations. An adaptive metapath search module was introduced to automatically discover semantic path combinations for guiding the propagation of high-order semantic information. A semantic aggregation mechanism integrating multi-head attention was designed to automatically learn the importance of each semantic path based on contextual information and achieve differentiated aggregation and dynamic fusion among paths. A structure-aware gated graph convolutional module was then incorporated to regulate the feature propagation intensity for suppressing redundant information and redcuing over-smoothing. Finally, the potential interactions between drugs and targets were predicted through an inner product operation. Results Compared with existing drug-target interaction prediction methods, the proposed method achieved an average improvement of 3.4% and 2.4%, 3.0% and 3.8% in terms of the area under the receiver operating characteristic curve (AUC) and the area under the precision-recall curve (AUPRC) on public datasets, respectively. Conclusion The drug-target interaction prediction method developed in this study can effectively extract complex high-order semantic and topological information from heterogeneous biological networks, thereby improving the accuracy and stability of drug-target interaction prediction. This method provides technical support and theoretical foundation for precise drug target discovery and targeted treatment of complex diseases.
对于DTI预测,应用自适应元图来引导异质网络中的信息聚合,以获得药物和蛋白质的特征。对于构建自适应元图的方法,首先,自适应元图中节点的数目取决于信息在异质网络中传播的次数,假设节点的特征在异质网络中传播T次,则自适应元图中的节点为{S0,S1,…, ST }。在自适应元图中,节点对之间的可能连接方式依据其状态在信息传播过程中的相对位置进行设定。对于给定的2个节点Si和St(0≤i,t≤T,i∈N且t∈N),需判断Si是否为St的前一状态,以及St是否为最终状态。当 i=t-1且t<T时,由于第i次信息传播中的节点特征会以某种方式影响第t次传播,因此从Si到St的可能连接是集合EM中除L2外的所有类型。当 i<t-1且t<T时,此时状态Si可能不会影响状态St,因此可能得连接包括L2,当t=T时,即最后一层状态,那么从St-1到St的可能连接进一步限制为与药物或靶点相关的连接。
为验证模型在特征表达方面的有效性,本文以 Luo 数据集为例,基于训练后的药物与靶标嵌入向量,采用 t-SNE方法对高维特征进行降维,并进行二维可视化分析(图7),图中蓝色与红色点分别对应标签为0和1的节点,代表药物与靶标。药物与靶标节点在嵌入空间中形成2个主要聚簇,类别边界较为清晰,簇内呈现一定的聚集趋势。部分节点分布于簇边缘,表现为离群点。
Abbasi MesrabadiH, FaezK, PirgaziJ. Drug-target interaction prediction based on protein features, using wrapper feature selection[J]. Sci Rep, 2023, 13(1): 3594. doi:10.1038/s41598-023-30026-y
[2]
DehghanA, AbbasiK, RazzaghiP, et al. CCL-DTI: contributing the contrastive loss in drug-target interaction prediction[J]. BMC Bioinformatics, 2024, 25(1): 48. doi:10.1186/s12859-024-05671-3
KhatunMS, HasanMM, KurataH. PreAIP: computational prediction of anti-inflammatory peptides by integrating multiple complementary features[J]. Front Genet, 2019, 10: 129. doi:10.3389/fgene.2019.00129
[5]
RuXQ, WangLD, LiLH, et al. Exploration of the correlation between GPCRs and drugs based on a learning to rank algorithm[J]. Comput Biol Med, 2020, 119: 103660. doi:10.1016/j.compbiomed.2020.103660
[6]
DingYJ, TangJJ, GuoF. Identification of drug-target interactions via multi-view graph regularized link propagation model[J]. Neurocomputing, 2021, 461: 618-31. doi:10.1016/j.neucom.2021.05.100
[7]
SongW, XuL, HanC, et al. Drug-target interaction predictions with multi-view similarity network fusion strategy and deep interactive attention mechanism[J]. Bioinformatics, 2024, 40(6): btae346. doi:10.1093/bioinformatics/btae346
[8]
LuoY, ZhaoX, ZhouJ, et al. A network integration approach for drug-target interaction prediction and computational drug repositioning from heterogeneous information[J]. Nat Commun, 2017, 8(1): 573. doi:10.1038/s41467-017-00680-8
[9]
ZhouD, XuZ, LiW, et al. MultiDTI: drug-target interaction prediction based on multi-modal representation learning to bridge the gap between new chemical entities and known heterogeneous network[J]. Bioinformatics, 2021, 37(23): 4485-92. doi:10.1093/bioinformatics/btab473
[10]
QiaoG, WangG, LiY. Causal enhanced drug-target interaction prediction based on graph generation and multi-source information fusion[J]. Bioinformatics, 2024, 40(10): btae570. doi:10.1093/bioinformatics/btae570
WanF, HongL, XiaoA, et al. NeoDTI: neural integration of neighbor information from a heterogeneous network for discovering new drug-target interactions[J]. Bioinformatics, 2019, 35(1): 104-11. doi:10.1093/bioinformatics/bty543
[16]
ZhaoT, HuY, ValsdottirLR, et al. Identifying drug-target interactions based on graph convolutional network and deep neural network[J]. Brief Bioinform, 2021, 22(2): 2141-50. doi:10.1093/bib/bbaa044
[17]
PengJ, WangY, GuanJ, et al. An end-to-end heterogeneous graph representation learning-based framework for drug-target interaction prediction[J]. Brief Bioinform, 2021, 22(5): bbaa430. doi:10.1093/bib/bbaa430
[18]
ZhuY, NingC, ZhangN, et al. GSRF-DTI: a framework for drug-target interaction prediction based on a drug-target pair network and representation learning on a large graph[J]. BMC Biol, 2024, 22(1): 156. doi:10.1186/s12915-024-01949-3
[19]
Mukesh M Raghuwanshi Pradnya S Borkar Sachin U Balvir. Node2Vec and machine learning: a powerful Duo for link prediction in social network[J]. Jes, 2024, 20(2s): 639-49. doi:10.52783/jes.1530
[20]
JohnsonSJ, MurtyMR, NavakanthI. A detailed review on word embedding techniques with emphasis on word2vec[J]. Multimed Tools Appl, 2024, 83(13): 37979-8007. doi:10.1007/s11042-023-17007-z
[21]
DingY, YaoQ, ZhaoH, et al. Diffmg: Differentiable meta graph search for heterogeneous graph neural networks[C].Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, 2021: 279-88. doi:10.1145/3447548.3467447
[22]
GuoYB, ZhouDM, RuanXL, et al. Variational gated autoencoder-based feature extraction model for inferring disease-miRNA associations based on multiview features[J]. Neural Netw, 2023, 165: 491-505. doi:10.1016/j.neunet.2023.05.052
[23]
RasamoelinaAD, AdjailiaF, SincakP. A review of activation function for artificial neural network[C]//2020 IEEE 18th World Symposium on Applied Machine Intelligence and Informatics (SAMI). January 23-25, 2020. Herlany, Slovakia. IEEE, 2020: 281-6. doi:10.1109/sami48414.2020.9108717
[24]
DubeySR, SinghSK, ChaudhuriBB. Activation functions in deep learning: a comprehensive survey and benchmark[J]. Neuroco-mputing, 2022, 503: 92-108. doi:10.1016/j.neucom.2022.06.111
[25]
DeoraP, GhaderiR, TaheriH, et al.On the optimization and generalization of multi-head attention[J]. arXiv preprint arXiv:2310. 12680, 2023.
TianY, WangX, YaoX, et al. Predicting molecular properties based on the interpretable graph neural network with multistep focus mechanism[J]. Brief Bioinform, 2023, 24(1): bbac534. doi:10.1093/bib/bbac534
[28]
WangF, LiuH. Understanding the behaviour of contrastive loss[C].Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2021: 2495-504. doi:10.1109/cvpr46437.2021.00252
[29]
RanjanP, KhanP, KumarS, et al. $łog$-sigmoid activation-based long short-term memory for time-series data classification[J]. IEEE Trans Artif Intell, 2024, 5(2): 672-83. doi:10.1109/tai.2023.3265641
[30]
ZhengY, PengH, ZhangXC, et al. Predicting drug targets from heterogeneous spaces using anchor graph hashing and ensemble learning[C]//2018 International Joint Conference on Neural Networks (IJCNN). July 8-13, 2018. Rio de Janeiro. IEEE, 2018: 1-7. doi:10.1109/ijcnn.2018.8489028
[31]
SrivastavaA, RawatBS, SinghG, et al. A review of optimization algorithms for training neural networks[C]//2023 International Conference on Sustainable Emerging Innovations in Engineering and Technology (ICSEIET). September 14-15, 2023. Ghaziabad, India. IEEE, 2023: 886-90. doi:10.1109/icseiet58677.2023.10303287
[32]
WangH, HuangF, XiongZ, et al. A heterogeneous network-based method with attentive meta-path extraction for predicting drug-target interactions[J]. Brief Bioinform, 2022, 23(4): bbac184. doi:10.1093/bib/bbac184
[33]
LiY, QiaoG, GaoX, et al. Supervised graph co-contrastive learning for drug-target interaction prediction[J]. Bioinformatics, 2022, 38(10): 2847-54. doi:10.1093/bioinformatics/btac164