blast 报错问题 awk 解决

BlastOutput.iterations.E.
Invalid value(s) [9] in VisibleString [AGO_S073922.1_geneid.geneid 2 exon (s) 261 – 281 6 aa, chain – incompleted …]
[blastall] ERROR: 20110903163100/t1.fna.blast.m7Output
BlastOutput.iterations.E.
Invalid value(s) [9] in VisibleString [AGO_S020563.1_geneid.geneid 2 exon (s) 1646 – 1669 6 aa, chain + incompleted …]
[blastall] ERROR: 20110903163100/t1.fna.blast.m7Output
BlastOutput.iterations.E.
Invalid value(s) [9] in VisibleString [AGO_S075634.1_geneid.geneid 2 exon (s) 252 – 287 10 aa, chain – incompleted …]
[blastall] ERROR: 20110903163100/t1.fna.blast.m7Output
BlastOutput.iterations.E.
Invalid value(s) [9] in VisibleString [AGO_S073717.1_geneid.geneid 2 exon (s) 408 – 437 8 aa, chain + incompleted …]
[blastall] ERROR: 20110903163100/t1.fna.blast.m7Output
BlastOutput.iterations.E.
Invalid value(s) [9] in VisibleString [AGO_S026839.1_geneid.geneid 2 exon (s) 412 – 465 16 aa, chain + incompleted …]

这是由于输入序列
>AGO_S001155.6_fgenesh.fgenesh 3 exon (s) 16076 – 17170 256 aa, chain –
MNAPEHTQKIVGAYLSDRKIQFTVGHEKKEFRVEAGVPQGSVIGPCLWNVMYNGLLKQKL
PEDVKIIAFADNVAVVATEWHKNFLEEAFGIVEEWMQKNGLTLAEHKTKVIVFTTRYTQK
NIKVRKHDAGQRRRKLLSCVMTSKLLYGAPCWAERMTTTIIRTVCSYRTVSHDAIAVVFT
DAFKDPVPNGTWDFWQVPLSILVKKYGQMPTLWIIPRYARACLFRVRRVVEAKARTSGIR
WRTNKGREYREIKDSE
>AGO_S057486.1_fgenesh.fgenesh 1 exon (s) 561 – 755 64 aa, chain +
MKMEVAADQAKLALSNVGDYGSTGLRTNKLIENRLKNVCMQIKNNDISILDFLMVVSHIR
KQCY
含有\t空格。标准fasta 格式要去掉
awk ‘{print $1}’ aphis_geneid_fgenesh_augustus.pep > aphis_geneid_fgenesh_augustus.pep.fasta 得到标准格式
shenzy@node5:/public/d_aphis/DNA_assembly/3.1fix/gene_predict/annotation/uniprot2$ head aphis_geneid_fgenesh_augustus.pep.fasta
>AGO_S001155.6_fgenesh.fgenesh
MNAPEHTQKIVGAYLSDRKIQFTVGHEKKEFRVEAGVPQGSVIGPCLWNVMYNGLLKQKL
PEDVKIIAFADNVAVVATEWHKNFLEEAFGIVEEWMQKNGLTLAEHKTKVIVFTTRYTQK
NIKVRKHDAGQRRRKLLSCVMTSKLLYGAPCWAERMTTTIIRTVCSYRTVSHDAIAVVFT
DAFKDPVPNGTWDFWQVPLSILVKKYGQMPTLWIIPRYARACLFRVRRVVEAKARTSGIR
WRTNKGREYREIKDSE
>AGO_S057486.1_fgenesh.fgenesh
MKMEVAADQAKLALSNVGDYGSTGLRTNKLIENRLKNVCMQIKNNDISILDFLMVVSHIR
KQCY

72 comments to blast 报错问题 awk 解决

Leave a Reply to film music licensing Cancel reply

  

  

  

You can use these HTML tags

<a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>