将 latex 源代码编译成 unicode 字符串

Question

潘多克对于不太复杂的文档来说，它表现不错。尝试

echo "\\\"o{A}c" | pandoc -f latex -t plain

或者，在 Python 中，

def latex_to_unicode(latex_string):
    '''Convert a LaTeX string to unicode.
    '''
    # Use pandoc for the job
    try:
        # This works in Python 3.4+
        return subprocess.check_output(
            ['pandoc', '-f', 'latex', '-t', 'plain'],
            input=latex_string
            )
    except TypeError:  # unexpected keyword 'input'
        p = subprocess.Popen(
            ['pandoc', '-f', 'latex', '-t', 'plain'],
            stdin=subprocess.PIPE,
            stdout=subprocess.PIPE,
            stderr=subprocess.PIPE
            )
        stdout, stderr = p.communicate(latex_string)
        return stdout.replace('\n', ' ').strip().decode('utf-8')

Answer 1

潘多克对于不太复杂的文档来说，它表现不错。尝试

echo "\\\"o{A}c" | pandoc -f latex -t plain

或者，在 Python 中，

def latex_to_unicode(latex_string):
    '''Convert a LaTeX string to unicode.
    '''
    # Use pandoc for the job
    try:
        # This works in Python 3.4+
        return subprocess.check_output(
            ['pandoc', '-f', 'latex', '-t', 'plain'],
            input=latex_string
            )
    except TypeError:  # unexpected keyword 'input'
        p = subprocess.Popen(
            ['pandoc', '-f', 'latex', '-t', 'plain'],
            stdin=subprocess.PIPE,
            stdout=subprocess.PIPE,
            stderr=subprocess.PIPE
            )
        stdout, stderr = p.communicate(latex_string)
        return stdout.replace('\n', ' ').strip().decode('utf-8')

将 latex 源代码编译成 unicode 字符串

答案1

相关内容