使用OpenMP并行化功能

2018-06-28 08:28:46

我试图并行运行代码，但我与私有/共享等与openmp相关的东西感到困惑。我使用的是c ++（msvc12或gcc）和openmp。

代码在循环中迭代，该循环由一个应该并行运行的块组成，后面跟着一个应该在完成所有并行运行时运行的块。并行处理的顺序并不重要。代码如下所示：

// some X, M, N, Y, Z are some constant values
const int processes = 4;
std::vector<double> vct(X);
std::vector<std::vector<double> > stackVct(processes, std::vector<double>(Y));
std::vector<std::vector<std::string> > files(processes, M)
for(int i=0; i < N; ++i)
{
  // parallel stuff
  for(int process = 0; process < processes; ++process)
  {
    std::vector<double> &otherVct = stackVct[process];
    const std::vector<std::string> &my_files = files[process];

    for(int file = 0; file < my_files.size(); ++file)
    { 
      // vct is read-only here, the value is not modified
      doSomeOtherStuff(otherVct, vct);

      // my_files[file] is read-only
      std::vector<double> thirdVct(Y);
      doSomeOtherStuff(my_files[file], thirdVct(Y));

      // thirdVct and vct are read-only
      doSomeOtherStuff2(thirdVct, otherVct, vct);
    }
  }
  // when all the parallel stuff is done, do this job
  // single thread stuff
  // stackVct is read-only, vct is modified
  doSingleTheadStuff(vct, stackVct)
}

如果性能更好，可以将“doSingleThreadSuff（...）”移动到并行循环中，但它需要由单个线程处理。最内层循环中的功能顺序不能改变。

我应该如何声明#pragma omp才能使其工作？谢谢！

并行运行for循环只是for循环语句之上的#pragma omp parallel for并且任何在for循环之外声明的变量都由所有线程共享，并且在for循环中声明的任何变量对于每个线程都是私有的。

请注意，如果您并行执行文件IO，则除非至少某些文件位于不同的物理硬盘驱动器上，否则可能看不到太多的加速（如果所有操作都是文件IO，则无法实现）。

也许像这样（请注意，这只是一个素描，我没有验证它，但你可以明白）：

// some X, M, N, Y, Z are some constant values
const int processes = 4;
std::vector<double> vct(X);
std::vector<std::vector<double> > stackVct(processes, std::vector<double>(Y));
std::vector<std::vector<std::string> > files(processes, M)
for(int i=0; i < N; ++i)
{
    // parallel stuff
    #pragma omp parallel firstprivate(vct, files) shared(stackVct)
    {
        #pragma omp for
        for(int process = 0; process < processes; ++process)
        {
            std::vector<double> &otherVct = stackVct[process];
            const std::vector<std::string> &my_files = files[process];

            for(int file = 0; file < my_files.size(); ++file)
            {
                // vct is read-only here, the value is not modified
                doSomeOtherStuff(otherVct, vct);

                // my_files[file] is read-only
                std::vector<double> thirdVct(Y);
                doSomeOtherStuff(my_files[file], thirdVct(Y));

                // thirdVct and vct are read-only
                doSomeOtherStuff2(thirdVct, otherVct, vct);
            }
        }
        // when all the parallel stuff is done, do this job
        // single thread stuff
        // stackVct is read-only, vct is modified
        #pragma omp single nowait
        doSingleTheadStuff(vct, stackVct)
    }
}

我将vct和files标记为第一个私有files ，因为它们是只读的，我认为它们不应该被修改，所以每个线程都会为自己获取这些变量的副本。

stackVct被标记为在所有线程之间共享，因为它们会对其进行修改。

最后，只有一个线程将执行doSingleTheadStuff函数，而不会强制其他线程等待。

链接地址: http://www.djcxy.com/p/79209.html

上一篇: Parallelize function using OpenMP

下一篇: Flash CS4 refuses to let go