Tytuł pozycji:
Identifying overlapping phylogenetic and geographic roots of HIV - 1 evolution through computational analyses
HIV-1 or Human Immuno Deficiency Virus-1 is the main causative agent of Acquired Immuno Deficiency Syndrome (AIDS). Human host infected with HIV - 1 extensively harbours many viral variants but very little is known about the difference in pattern[17] of evolution of phylogenetic lineages of HIV-1 non recombinant, normal inter subtype recombinant and main two specific recombinant forms of HIV-1 i.e., Circulating Recombinant Forms (CRFs) and Unique Recombinant Forms (URFs). This study is mainly concerned with study of the difference in evolutionary lineages of non-recombinant and recombinant sequences of HIV-1 genome sequences and identification of geographically rich areas which has reported high degree of HIV-1 occurrence and variety. Total 1550 HIV-1 genome sequences were obtained from HIV Los Alamos Database. The sequences were aligned using MAFFT (Multiple Alignment using Fast Fourier Transform) web server tool. Alignment was carried out using 10 different set of alignment parameter values. After alignment the aligned file was used for constructing N-J phylogenetic tree using Clustal X2 tool. Phylogenetic analysis was performed keeping in mind the category to which the sequence belongs. Upon analysis it was observed that the clade containing the probable ancestor belongs remained constant in all cases of different alignment values. Non recombinant isolates, inter subtype recombinants, CRFs, URFs all followed different patterns of evolution. Non recombinant sequences were found geographically specific and subtype specific to some extent whereas, normal recombinants were subtype specific and less geographically specific. CRFs showed variation among the pattern of their evolution. At some instances the sequences occurred as sister taxa of non-recombinant or normal inter subtype recombinant sequences, while at some instances as sister taxa of other CRFs where they were geographically specific. Three CRFs existed as completely diverged sequences. URFs were four in number; two of them were Indian isolates of while other two were Japanese isolates. URFs were found to be totally geographically specific. Geography wise high rate of variation was observed in India and Japan as these two countries had sequences belonging to all of the above categories. Cameroon and South Africa have very large number isolates and a considerable amount of genetic variation among isolates but they lack URFs.