在 fasta 文件上查找具有位置、行和列信息的引物

Question

我看过了，最简单的解决办法是映泰这是用 Perl 实现的。

use strict;
use warnings;
use Bio::SeqIO;

my $usage = "perl dnamotif.pl <fasta file> <primers file>";
my $fasta_filename = shift(@ARGV) or die("Usage: $usage $!");
my $pfile = shift(@ARGV) or die("Usage: $usage $!");
my $start = 0;
my $motifCounter=1; 

my $fasta_parser = Bio::SeqIO->new(-file => $fasta_filename, -format => 'Fasta');



while(my $seq_obj = $fasta_parser->next_seq())
{
    printf("Searching sequence '%s'...\n", $seq_obj->id);

    open my $info, $pfile or die "Could not open primers $pfile: $!";

    while( my $motif = <$info>)  {   
        chomp $motif;
        printf("\n[%2s Looking motif [%s]]\n", $motifCounter, $motif);    
        $start = 0;

        while((my $pos = index($seq_obj->seq(), $motif, $start)) != -1) {

            printf("\nmotif found at position %8d ", $pos + 1);
            printf("[%8d,%8d]", ((($pos)/70))+1, (($pos) % 70)+1);
            $start = $pos + 1;
        
        }
        
        $motifCounter++;
        printf("\n");
    
    }
    close $info;

}

我找到了基本代码所以并根据我的需要对其进行了修改（我需要搜索 80 个引物）。我将其放在这里以满足任何人的需求。实际文件的部分输出是；

Searching sequence 'ref|NW_024108998.1|:1-6077679'...

[ 1 Looking motif [CAATCGCCGT]]


[ 2 Looking motif [TTCCGAACCC]]

motif found at position   660830 [    9441,      30]
motif found at position   788016 [   11258,      26]
motif found at position  1349822 [   19284,      12]

Answer 1

我看过了，最简单的解决办法是映泰这是用 Perl 实现的。

use strict;
use warnings;
use Bio::SeqIO;

my $usage = "perl dnamotif.pl <fasta file> <primers file>";
my $fasta_filename = shift(@ARGV) or die("Usage: $usage $!");
my $pfile = shift(@ARGV) or die("Usage: $usage $!");
my $start = 0;
my $motifCounter=1; 

my $fasta_parser = Bio::SeqIO->new(-file => $fasta_filename, -format => 'Fasta');



while(my $seq_obj = $fasta_parser->next_seq())
{
    printf("Searching sequence '%s'...\n", $seq_obj->id);

    open my $info, $pfile or die "Could not open primers $pfile: $!";

    while( my $motif = <$info>)  {   
        chomp $motif;
        printf("\n[%2s Looking motif [%s]]\n", $motifCounter, $motif);    
        $start = 0;

        while((my $pos = index($seq_obj->seq(), $motif, $start)) != -1) {

            printf("\nmotif found at position %8d ", $pos + 1);
            printf("[%8d,%8d]", ((($pos)/70))+1, (($pos) % 70)+1);
            $start = $pos + 1;
        
        }
        
        $motifCounter++;
        printf("\n");
    
    }
    close $info;

}

我找到了基本代码所以并根据我的需要对其进行了修改（我需要搜索 80 个引物）。我将其放在这里以满足任何人的需求。实际文件的部分输出是；

Searching sequence 'ref|NW_024108998.1|:1-6077679'...

[ 1 Looking motif [CAATCGCCGT]]


[ 2 Looking motif [TTCCGAACCC]]

motif found at position   660830 [    9441,      30]
motif found at position   788016 [   11258,      26]
motif found at position  1349822 [   19284,      12]

在 fasta 文件上查找具有位置、行和列信息的引物

答案1

相关内容