Viruses cause various infections that may affect human lifestyle for durations ranging from several days to for many years. Although preventative and therapeutic remedies are available for many viruses, they may still have a profound impact on human life. The human immunodeficiency virus type 1 is the most common cause of HIV infection, which represents one of the most dangerous and complex diseases since it affects the immune system and causes its disruption, leading to secondary complications and negatively influencing health-related quality of life. While highly active antiretroviral therapy may decrease the viral load and the velocity of HIV infection progression, some individual peculiarities may affect viral load control or the progression of T-cell malfunction induced by HIV. Our study is aimed at the text-based identification of molecular mechanisms that may be involved in viral infection progression, using HIV as a case study. Specifically, we identified human proteins and genes which commonly occurred, overexpressed or underexpressed, in the collections of publications relevant to (i) HIV infection progression and (ii) acute and chronic stages of HIV infection. Then, we considered biological processes that are controlled by the identified protein and genes. We verified the impact of the identified molecules in the associated clinical study.
Keywords: HIV/AIDS; viral infection; viral infection progression; acute HIV infection; chronic HIV infection; text mining; machine learning