https://medium.com/analytics-vidhya/extracting-text-from-scanned-pdf-using-pytesseract-open-cv-cd670ee38052