Machine Learning/Kaggle HIV: Difference between revisions

From Noisebridge
Jump to navigation Jump to search
No edit summary
No edit summary
Line 3: Line 3:
*How to Handle DNA Sequences
*How to Handle DNA Sequences
**Use [http://en.wikipedia.org/wiki/Sequence_alignment sequence alignment] to organize sequences across rows ([http://en.wikipedia.org/wiki/List_of_sequence_alignment_software software])
**Use [http://en.wikipedia.org/wiki/Sequence_alignment sequence alignment] to organize sequences across rows ([http://en.wikipedia.org/wiki/List_of_sequence_alignment_software software])
*There are two proteins we have sequences for - HIV Protease and HIV Reverse transcriptase. A great video that describes how these work is available here:
**http://www.youtube.com/watch?v=RO8MP3wMvqg&feature=related
*HIV Protease helps cut up HIV proteins in to their right shapes once the cell starts producing them:
**PR Wiki: http://en.wikipedia.org/wiki/HIV-1_protease
**PR Sequence Info: http://www.bioafrica.net/proteomics/POL-PRprot.html
**PR Drug Resistance Info: http://hivdb.stanford.edu/cgi-bin/PositionPhenoSummary.cgi
*Reverse Transcriptase takes the viral RNA and converts it into DNA to be integrated into the cell:
**RT Wiki: http://en.wikipedia.org/wiki/Reverse_transcriptase (see HIV subsection)
**RT Sequence Info: http://bioafrica.mrc.ac.za/proteomics/POL-RTprot.html
**RT Drug Resistance Info: http://hivdb.stanford.edu/cgi-bin/PositionPhenoSummary.cgi

Revision as of 11:02, 30 June 2010

http://kaggle.com/hivprogression