比较不同偏移量的二进制文件，报告第一个差异的偏移量

Question

这是一个不错的起点这个很好的答案类似的问题。此函数将确定 fileA 是否包含在 fileB 中，并返回 fileA 在 fileB 中的偏移量。

我添加了返回停止匹配时的偏移量的功能。MinimumMatch如果文件的开头非常相似，您可能需要尝试以下设置：

Function Find-BytesUntilMismatch([byte[]]$Bytes, [byte[]]$Search, [int]$Start, [Switch]$All, [int]$MinimumMatch=200) {

    # Starting from offset $start, iterate through each byte
    For ($Index = $Start; $Index -le $Bytes.Length; $Index++) {

        # Check if byte matches, iterate through each following byte until 
        # bytes don't match or all bytes of $search are found:
        For ($i = 0; $i -lt $Search.Length -and $Bytes[$Index + $i] -eq $Search[$i]; $i++) {}

        # Search has exited, so check for complete file or return offset
        If ($i -lt $Search.Length -and $i -gt $MinimumMatch) { 
            Write-Output "file stopped matching at offset:$($index + $i); Total bytes matched:$($i)" 
            break 
        }
        If ($i -ge $Search.Length) { 
            Write-Output "full match completed at offset: $($Index + $i); Total bytes matched:$($i)" 

            # Check for additional matches if $All is set
            If (!$All) { Return } 
        } 
    }
    Write-Output "Search Complete"
}

如果您知道的开头FileA包含在FileB偏移量内10000，则运行以下命令：

# Import byte strings from files:
$FileA = [System.IO.File]::ReadAllBytes("C:\path\to\FileA.dat")
$FileB = [System.IO.File]::ReadAllBytes("C:\path\to\FileB.bin")

# Run function:
Find-BytesUntilMismatch -Search $FileA -Bytes $FileB -Start 91855 -MinimumMatch 100

输出：

file stopped matching at offset:1629233; Total bytes matched: 1537377
Search Complete

笔记：

偏移Start量从 0 开始，因此您可能需要向此命令提供偏移字节 -1。
数组是 32 位的，因此可以处理的最大文件大小可能是 2 GB

Answer 1

这是一个不错的起点这个很好的答案类似的问题。此函数将确定 fileA 是否包含在 fileB 中，并返回 fileA 在 fileB 中的偏移量。

我添加了返回停止匹配时的偏移量的功能。MinimumMatch如果文件的开头非常相似，您可能需要尝试以下设置：

Function Find-BytesUntilMismatch([byte[]]$Bytes, [byte[]]$Search, [int]$Start, [Switch]$All, [int]$MinimumMatch=200) {

    # Starting from offset $start, iterate through each byte
    For ($Index = $Start; $Index -le $Bytes.Length; $Index++) {

        # Check if byte matches, iterate through each following byte until 
        # bytes don't match or all bytes of $search are found:
        For ($i = 0; $i -lt $Search.Length -and $Bytes[$Index + $i] -eq $Search[$i]; $i++) {}

        # Search has exited, so check for complete file or return offset
        If ($i -lt $Search.Length -and $i -gt $MinimumMatch) { 
            Write-Output "file stopped matching at offset:$($index + $i); Total bytes matched:$($i)" 
            break 
        }
        If ($i -ge $Search.Length) { 
            Write-Output "full match completed at offset: $($Index + $i); Total bytes matched:$($i)" 

            # Check for additional matches if $All is set
            If (!$All) { Return } 
        } 
    }
    Write-Output "Search Complete"
}

如果您知道的开头FileA包含在FileB偏移量内10000，则运行以下命令：

# Import byte strings from files:
$FileA = [System.IO.File]::ReadAllBytes("C:\path\to\FileA.dat")
$FileB = [System.IO.File]::ReadAllBytes("C:\path\to\FileB.bin")

# Run function:
Find-BytesUntilMismatch -Search $FileA -Bytes $FileB -Start 91855 -MinimumMatch 100

输出：

file stopped matching at offset:1629233; Total bytes matched: 1537377
Search Complete

笔记：

偏移Start量从 0 开始，因此您可能需要向此命令提供偏移字节 -1。
数组是 32 位的，因此可以处理的最大文件大小可能是 2 GB

比较不同偏移量的二进制文件，报告第一个差异的偏移量

答案1

相关内容