天天看點

怎麼将html檔案轉化為csv,使用python将HTML檔案轉換為CSV檔案

Fri21Jul13:14:15BST2017

TATSignalandTMH near C-terminus

1 GCF_000688455.1_ASM68845v1_protein.faa.gz Acidobacteriumailaaui
Taxonomy Acidobacteria;Acidobacteriia;Acidobacteriales;Acidobacteriaceae;Acidobacterium
First60AAs MSRRTFVSSATAGLAALGALSSAAEGHAQLVWTSKNWKLAEFETLLREPARIRQVYDVTQ
WP_026442391.1 hypothetical protein[Acidobacteriumailaaui]
TMHMM WP_026442391.1 Length:233
TMHMM WP_026442391.1 Numberof predictedTMHs:1
TMHMM WP_026442391.1 Expnumber ofAAsinTMHs:21.25002
TMHMM WP_026442391.1 Expnumber,first60AAs:1.35114
TMHMM WP_026442391.1 Totalprob of N-in:0.67991
TMHMM WP_026442391.1 WP_026442391.1 inside 1 201
TMHMM WP_026442391.1 WP_026442391.1 TMhelix 202 224
TMHMM WP_026442391.1 WP_026442391.1 outside 225 233
2 GCF_000022565.1_ASM2256v1_protein.faa.gz Acidobacteriumcapsulatum ATCC51196
Taxonomy Acidobacteria;Acidobacteriia;Acidobacteriales;Acidobacteriaceae;Acidobacterium;Acidobacteriumcapsulatum
First60AAs MKSISRRSFVTTAAAGMAALGSLGPALPAAQGQAVEMASDWDISSFNQLAQSPARVKQLF
WP_012680923.1 Tatpathway signal sequence domain-containing protein[Acidobacteriumcapsulatum]
TMHMM WP_012680923.1 Length:237
TMHMM WP_012680923.1 Numberof predictedTMHs:1
TMHMM WP_012680923.1 Expnumber ofAAsinTMHs:31.62059
TMHMM WP_012680923.1 Expnumber,first60AAs:5.92535
TMHMM WP_012680923.1 Totalprob of N-in:0.86701
TMHMM WP_012680923.1 WP_012680923.1 inside 1 205
TMHMM WP_012680923.1 WP_012680923.1 TMhelix 206 228
TMHMM WP_012680923.1 WP_012680923.1 outside 229 237
3 GCF_000014005.1_ASM1400v1_protein.faa.gz CandidatusKoribacterversatilisEllin345
Taxonomy Acidobacteria;Acidobacteriia;Acidobacteriales;Acidobacteriaceae;CandidatusKoribacter;CandidatusKoribacterversatilis
First60AAs MGEKALMSKKPTIEEHLKATGVTRRSFVQLCGMLMAAAPIGLSLTSKASAQEVAKVVGKA
WP_011525036.1 hydrogenase2small subunit[CandidatusKoribacterversatilis]
TMHMM WP_011525036.1 Length:401
TMHMM WP_011525036.1 Numberof predictedTMHs:1
TMHMM WP_011525036.1 Expnumber ofAAsinTMHs:19.93057
TMHMM WP_011525036.1 Expnumber,first60AAs:2.05251
TMHMM WP_011525036.1 Totalprob of N-in:0.15168
TMHMM WP_011525036.1 WP_011525036.1 outside 1 344
TMHMM WP_011525036.1 WP_011525036.1 TMhelix 345 367
TMHMM WP_011525036.1 WP_011525036.1 inside 368 401