Machine Learning/Kaggle HIV
Jump to navigation
Jump to search
http://kaggle.com/hivprogression
- How to Handle DNA Sequences
- Use sequence alignment to organize sequences across rows (software)
- There are two proteins we have sequences for - HIV Protease and HIV Reverse transcriptase. A great video that describes how these work is available here:
- HIV Protease helps cut up HIV proteins in to their right shapes once the cell starts producing them:
- PR Wiki: http://en.wikipedia.org/wiki/HIV-1_protease
- PR Sequence Info: http://www.bioafrica.net/proteomics/POL-PRprot.html
- PR Drug Resistance Info: http://hivdb.stanford.edu/cgi-bin/PositionPhenoSummary.cgi
- Reverse Transcriptase takes the viral RNA and converts it into DNA to be integrated into the cell:
- RT Wiki: http://en.wikipedia.org/wiki/Reverse_transcriptase (see HIV subsection)
- RT Sequence Info: http://bioafrica.mrc.ac.za/proteomics/POL-RTprot.html
- RT Drug Resistance Info: http://hivdb.stanford.edu/cgi-bin/PositionPhenoSummary.cgi