如何根据条件在文本文件中执行搜索和替换？

Question 1

我会用 perl 来解决这个问题。http://search.cpan.org/~ambs/Text-BibTeX-0.70/lib/Text/BibTeX.pm应该有帮助。例如：

use Text::BibTeX;

$bibfile = new Text::BibTeX::File "foo.bib";
$newfile = new Text::BibTeX::File ">newfoo.bib";

while ($entry = new Text::BibTeX::Entry $bibfile) {
    next unless $entry->parse_ok;

    if ($has_year = $entry->exists ('year')) {
        $year = $entry->get('year');
    }
    if ($has_date = $entry->exists ('date')) {
        $date = $entry->get('date');
    }
    if ($has_year and ! $has_date) {
        $entry->set('date', $year);
    }
    if ($has_date and ! $has_year) {
        $entry->set('year', substr($date, 0, 4));
    }
    $entry->write ($newfile);
}

Answer

我会用 perl 来解决这个问题。http://search.cpan.org/~ambs/Text-BibTeX-0.70/lib/Text/BibTeX.pm应该有帮助。例如：

use Text::BibTeX;

$bibfile = new Text::BibTeX::File "foo.bib";
$newfile = new Text::BibTeX::File ">newfoo.bib";

while ($entry = new Text::BibTeX::Entry $bibfile) {
    next unless $entry->parse_ok;

    if ($has_year = $entry->exists ('year')) {
        $year = $entry->get('year');
    }
    if ($has_date = $entry->exists ('date')) {
        $date = $entry->get('date');
    }
    if ($has_year and ! $has_date) {
        $entry->set('date', $year);
    }
    if ($has_date and ! $has_year) {
        $entry->set('year', substr($date, 0, 4));
    }
    $entry->write ($newfile);
}

Question 2

笔记：此解决方案适用于原始需求集。需要更新才能与当前版本配合使用。而且，perl基于此的答案无论如何都更简洁 :-)

如果您不介意创建一些临时文件，这可以作为一个起点：将其复制到文件中并设置可执行标志（chmod +x file）

#!/bin/bash
INFILE=$1

# split the file first
awk '/^@/{x="tmp__"++i}{print > x;}' $INFILE

# process individual files
for file in tmp__* ; do 
    DATE=$(grep "^[[:space:]]*Date" $file | sed "s/.*{\(.*\)}.*/\1/g")
    YEAR=$(grep "^[[:space:]]*Year" $file | sed "s/.*{\(.*\)}.*/\1/g")

    # Both year and date. Substitute year with date
    if [[ -n "$DATE" && -n "$YEAR" ]] ; then
        sed -i "s/\(^[[:space:]]*Year.*\)${YEAR}\(.*\)/\1${DATE}\2/g" $file
    fi

    # Only year
    if [[ -z "$DATE" && -n "$YEAR" ]] ; then
        sed -i "s/\(^[[:space:]]*\)Year/\1Date/g" $file
    fi
done

# concatenate the files back
cat tmp__* > out.bib
rm -f tmp__*

该脚本的作用是：

接受一个参数 - 输入文件名
将文件拆分为多个临时文件，每个文件仅包含一条记录
遍历文件并根据您的指示单独处理它们（前提是我理解它们很好，即 - 见下文）
将处理后的文件连接到 out.bib 中
删除临时文件。

该脚本不会修改原始输入文件，因此它应该是非常安全的。

我仍然不完全清楚您的要求，因此，如果您尝试一下并发现某些情况没有达到您的预期 - 请随时告诉我，我会尝试改进它。

Answer

笔记：此解决方案适用于原始需求集。需要更新才能与当前版本配合使用。而且，perl基于此的答案无论如何都更简洁 :-)

如果您不介意创建一些临时文件，这可以作为一个起点：将其复制到文件中并设置可执行标志（chmod +x file）

#!/bin/bash
INFILE=$1

# split the file first
awk '/^@/{x="tmp__"++i}{print > x;}' $INFILE

# process individual files
for file in tmp__* ; do 
    DATE=$(grep "^[[:space:]]*Date" $file | sed "s/.*{\(.*\)}.*/\1/g")
    YEAR=$(grep "^[[:space:]]*Year" $file | sed "s/.*{\(.*\)}.*/\1/g")

    # Both year and date. Substitute year with date
    if [[ -n "$DATE" && -n "$YEAR" ]] ; then
        sed -i "s/\(^[[:space:]]*Year.*\)${YEAR}\(.*\)/\1${DATE}\2/g" $file
    fi

    # Only year
    if [[ -z "$DATE" && -n "$YEAR" ]] ; then
        sed -i "s/\(^[[:space:]]*\)Year/\1Date/g" $file
    fi
done

# concatenate the files back
cat tmp__* > out.bib
rm -f tmp__*

该脚本的作用是：

接受一个参数 - 输入文件名
将文件拆分为多个临时文件，每个文件仅包含一条记录
遍历文件并根据您的指示单独处理它们（前提是我理解它们很好，即 - 见下文）
将处理后的文件连接到 out.bib 中
删除临时文件。

该脚本不会修改原始输入文件，因此它应该是非常安全的。

我仍然不完全清楚您的要求，因此，如果您尝试一下并发现某些情况没有达到您的预期 - 请随时告诉我，我会尝试改进它。

如何根据条件在文本文件中执行搜索和替换？

编辑：事后看来一切都很清楚

以下是以前的情况，当时我还没有想过 JabRef 的内部运作

答案1

答案2

相关内容