使用 bash 编辑元数据

Question 1

未经测试，作为起点，您可以适应：

for pic in *.jpeg *.bmp *.png; do
  serial="${pic%.*}"
  if test -r "${serial#photo-}"; then
    tags=`sed -n 's/.*<meta name="keywords" content="\([^"]*\)".*/\1/p' "${serial#photo-}"`
    # do what you want with "$pic" using "$tags"
  fi
done

因此，您迭代所有图片文件，测试是否可以读取删除了前缀和扩展名的文件，然后从元数据文件中删除标签。我不确定您打算使用什么工具来编辑图片的元数据。

Answer

未经测试，作为起点，您可以适应：

for pic in *.jpeg *.bmp *.png; do
  serial="${pic%.*}"
  if test -r "${serial#photo-}"; then
    tags=`sed -n 's/.*<meta name="keywords" content="\([^"]*\)".*/\1/p' "${serial#photo-}"`
    # do what you want with "$pic" using "$tags"
  fi
done

因此，您迭代所有图片文件，测试是否可以读取删除了前缀和扩展名的文件，然后从元数据文件中删除标签。我不确定您打算使用什么工具来编辑图片的元数据。

Question 2

在处理所有文件之前，最好使用您最喜欢的 GUI 工具设置所需的字段。然后分析该文件出口工具:

exiftool -XMP:all -IPTC:all test.jpg

它将打印字段的确切名称。之后您可以批量处理所有文件。例如，要设置XMP:description，发出：

exiftool -XMP:description="the" test.jpg

替代方案是ImageMagick 包中的identify和工具。convert

要从 html 中提取，我推荐使用 perl 包HTML树

test.html给定包含以下内容的文件：

<html><head>
<meta name="keywords" content="tag1, tag2, tag3, etc" />
</head>
<body></body>

运行此 perl 脚本来提取标签：

use HTML::TreeBuilder 5 -weak; # Ensure weak references in use
my $tree = HTML::TreeBuilder->new; # empty tree
$tree->parse_file("test.html");
my $meta = $tree->look_down(
  _tag => "meta",
  name => "keywords"
);
print $meta->attr("content");

Answer