How to determine if an image needs to be rotated

I am trying to find a way to determine whether an image needs to be rotated in order for the text to be horizontally aligned. And if it does need to be rotated then by how many degrees?

I am sending the images to tesseract and for tesseract to be effective, the text in the images needs to be horizontally aligned.

I'm looking for a way do this without depending on the "Orientation" metadata in the image.

I've thought of following ways to do this:

  • Rotate the image 90 degrees clockwise four times and send all four images to tesseract. This isn't ideal because of the need to process one image 4 times.
  • Use hough line transform to see if the lines are vertical or horizontal. If they are vertical then rotate the image. This way the image still might need to be rotated 180 degrees. So I'm unsure how effective this would be.
  • I'm wondering if there are other ways to accomplish this using OpenCV, imageMagik or any other image processing techniques.


    Aytempting ocr on all 4 orientations seems like a reasonable choice, and I doubt you will find a more reliable heuristic.

    If speed is an issue, you could OCR a small part of the image first. Select a rectangular region, that has the proper amount of edge pixels and white/black ratio for text, then send that to tesseract in different orientations. With a small region, you could even try smaller steps than 90°, or combine it with another heuristic like Hough.

    If you remember the most likely orientation based on previous images, and stop once an orientation is successfully processed by tesseract, you probably do not even have to try most orientations in most cases.


    If you have a 1000 images which say horizontal or vertical, you can resize these images to 224x224 and then fine-tune a Convolutional neural network, like AlexNet or VGG for this task. If you want to know how many right rotations to make for the image, you can set the labels as the number of clock-wise rotations, like 0,1,2,3.

    http://caffe.berkeleyvision.org/gathered/examples/finetune_flickr_style.html

    链接地址: http://www.djcxy.com/p/67156.html

    上一篇: 计算另一幅图像的旋转角度图像

    下一篇: 如何确定图像是否需要旋转