如何使用 Perl 将一个文本文件拆分为多个文本文件？

Question 1

这适用于给定的格式。这假设文件始终以 00:00:00:00 开头。

#!/usr/bin/env perl

use strict;
use warnings;

open(my $infh, '<', 'ABC_TabDelim.txt') or die $!;

my $outfh;
my $filecount = 0;
while ( my $line = <$infh> ) {
    if ( $line =~ /^00:00:00:00/ ) {
        close($outfh) if $outfh;
        open($outfh, '>', sprintf('ABC%02d_TabDelim.txt', ++$filecount)) or die $!;        
    }
    print {$outfh} $line or die "Failed to write to file: $!";
}

close($outfh);
close($infh);

Answer

这适用于给定的格式。这假设文件始终以 00:00:00:00 开头。

#!/usr/bin/env perl

use strict;
use warnings;

open(my $infh, '<', 'ABC_TabDelim.txt') or die $!;

my $outfh;
my $filecount = 0;
while ( my $line = <$infh> ) {
    if ( $line =~ /^00:00:00:00/ ) {
        close($outfh) if $outfh;
        open($outfh, '>', sprintf('ABC%02d_TabDelim.txt', ++$filecount)) or die $!;        
    }
    print {$outfh} $line or die "Failed to write to file: $!";
}

close($outfh);
close($infh);

Question 2

干得好。没有错误检查，运行为，例如perl split file-to-munge

更新：按照金发姑娘的建议进行脚本清理

#!/usr/bin/perl

$n = 1;
while(<>) {
    if(/^00:00:00:00/) {
        close($out) if(n != 1);
        $fn = sprintf("ABC%02d_TabDelim.txt", $n++);
        open($out, ">", "$fn");
    }
    print OUT;
}

Answer

干得好。没有错误检查，运行为，例如perl split file-to-munge

更新：按照金发姑娘的建议进行脚本清理

#!/usr/bin/perl

$n = 1;
while(<>) {
    if(/^00:00:00:00/) {
        close($out) if(n != 1);
        $fn = sprintf("ABC%02d_TabDelim.txt", $n++);
        open($out, ">", "$fn");
    }
    print OUT;
}

Question 3

如果该示例输入的输出预计为 4 个文件，每个文件有 3 行，每行第一行以“00:00:00:00”开头，另外 2 行如下：

perl -ne 'if(/^[0:]{11}/){close F if$f;open F,sprintf(">ABC%02d_TabDelim.txt",++$f)}print F' ABC_TabDelim.txt

Answer

如果该示例输入的输出预计为 4 个文件，每个文件有 3 行，每行第一行以“00:00:00:00”开头，另外 2 行如下：

perl -ne 'if(/^[0:]{11}/){close F if$f;open F,sprintf(">ABC%02d_TabDelim.txt",++$f)}print F' ABC_TabDelim.txt

Question 4

你有一个 perl 的解决方案，这是你可以使用 awk 实现的一种方法：

awk '/00:00:00:00/ { out = sprintf("ABC%02d_TabDelimit.txt", ++i) } { print > out }' ABC_TabDelim.txt

如果你必须分成许多如果您希望在进行过程中关闭每个文件，请在 sprintf 函数前面加上if(out) close(out)：

awk '/00:00:00:00/ { if(out) close(out); out = sprintf("ABC%02d_TabDelimit.txt", ++i) } { print > out }' ABC_TabDelim.txt

Answer

你有一个 perl 的解决方案，这是你可以使用 awk 实现的一种方法：

awk '/00:00:00:00/ { out = sprintf("ABC%02d_TabDelimit.txt", ++i) } { print > out }' ABC_TabDelim.txt

如果你必须分成许多如果您希望在进行过程中关闭每个文件，请在 sprintf 函数前面加上if(out) close(out)：

awk '/00:00:00:00/ { if(out) close(out); out = sprintf("ABC%02d_TabDelimit.txt", ++i) } { print > out }' ABC_TabDelim.txt

如何使用 Perl 将一个文本文件拆分为多个文本文件？

答案1

答案2

答案3

答案4

相关内容