将队列中的作业调度到多个线程上

Question

听起来你想要一个工作队列。您可以使用需要处理的文件集合填充该队列，并使用一个函数将项目从队列中出列，该函数执行必要的锁定以防止线程之间的竞争。然后启动您想要的任意线程。每个线程将从队列中取出一个项目，对其进行处理，然后将下一个项目取出。当队列变空时，线程可以阻塞等待更多输入，或者如果您知道不会有更多输入，则线程可以终止。

这是一个简单的例子：

#include <cstdio>
#include <mutex>
#include <queue>
#include <thread>

template<typename T>
class ThreadSafeQueue {
public:
    void enqueue(const T& element)
    {
        std::lock_guard<std::mutex> lock(m_mutex);

        m_queue.push(element);
    }

    bool dequeue(T& value)
    {
        std::lock_guard<std::mutex> lock(m_mutex);

        if (m_queue.empty()) {
            return false;
        }

        value = m_queue.front();
        m_queue.pop();

        return true;
    }

private:
    std::mutex m_mutex;
    std::queue<T> m_queue;
};

static void threadEntry(const int threadNumber, ThreadSafeQueue<std::string>* const queue)
{
    std::string filename;

    while (queue->dequeue(filename)) {
        printf("Thread %d processing file '%s'\n", threadNumber, filename.c_str());
    }
}

int main()
{
    ThreadSafeQueue<std::string> queue;

    // Populate queue
    for (int i = 0; i < 100000; ++i) {
        queue.enqueue("filename_" + std::to_string(i) + ".txt");
    }

    const size_t NUM_THREADS = 4;

    // Spin up some threads
    std::thread threads[NUM_THREADS];
    for (int i = 0; i < NUM_THREADS; ++i) {
        threads[i] = std::thread(threadEntry, i, &queue);
    }

    // Wait for threads to finish
    for (int i = 0; i < NUM_THREADS; ++i) {
        threads[i].join();
    }

    return 0;
}

编译：

$ g++ example.cpp -pthread

该程序定义了ThreadSafeQueue一个具有内部锁定的队列，以允许多个线程同时访问它。

该main函数首先填充队列。然后它启动 4 个线程。每个线程从队列中读取一个值并“处理”它（这里是通过将消息打印到标准输出）。当队列为空时，线程终止。该main函数在返回之前等待线程终止。

请注意，此设计假设所有元素在线程启动之前都已填充到队列中。经过一些更改，它可以扩展为支持在线程运行时处理新工作。

Answer 1