Using semantic analysis of texts for the identification of drugs with similar therapeutic effects


Cite item

Full Text

Open Access Open Access
Restricted Access Access granted
Restricted Access Subscription Access

Abstract

Semantic analysis of text collections was used to identify drugs with similar therapeutic activity. Natural language processing methods were applied to analyse > 2.5 mln texts from drug reviews (in English) found on patient forums and discussion boards. In order to obtain distributed word representations form the input data, a continuous bag-of-words type model was used. Such model is one of the word2vec models intended to analyse the natural language semantics. This allowed the assignment of a numeric vector to each drug name. A list of pairs of drugs with similar vectors was formed. An analysis of this list confirmed that similar word vectors correspond to either drugs with the same active compound or to drugs with close therapeutic effects that belong to the same therapeutic group. The chemical similarity in such drug pairs was found to be low. The suggested procedure was used to visualize the chemical drug space and in the search for compounds with potentially similar biological effects among drugs of different therapeutic groups.

About the authors

E. V. Tutubalina

Kazan Federal University

Author for correspondence.
Email: elvtutubalina@kpfu.ru
Russian Federation, 18 ul. Kremlyovskaya, Kazan, 420008

Z. Sh. Miftahutdinov

Kazan Federal University

Email: elvtutubalina@kpfu.ru
Russian Federation, 18 ul. Kremlyovskaya, Kazan, 420008

R. I. Nugmanov

Kazan Federal University

Email: elvtutubalina@kpfu.ru
Russian Federation, 18 ul. Kremlyovskaya, Kazan, 420008

T. I. Madzhidov

Kazan Federal University

Email: elvtutubalina@kpfu.ru
Russian Federation, 18 ul. Kremlyovskaya, Kazan, 420008

S. I. Nikolenko

Kazan Federal University; St. Petersburg Department of V. A. Steklov Institute of Mathematics, Russian Academy of Sciences

Email: elvtutubalina@kpfu.ru
Russian Federation, 18 ul. Kremlyovskaya, Kazan, 420008; 27 nab. Reki Fontanki, St. Petersburg, 191011

I. S. Alimova

Kazan Federal University

Email: elvtutubalina@kpfu.ru
Russian Federation, 18 ul. Kremlyovskaya, Kazan, 420008

A. E. Tropsha

Kazan Federal University; University of North Carolina at Chapel Hill

Email: elvtutubalina@kpfu.ru
Russian Federation, 18 ul. Kremlyovskaya, Kazan, 420008; 153A Country club Road, Jackson Hall, North Carolina, NC 27514

Supplementary files

Supplementary Files
Action
1. JATS XML

Copyright (c) 2017 Springer Science+Business Media, LLC, part of Springer Nature