SharePoint 2013 Search has the built-in ability to OCR and index the content of your scanned tiff images during a crawl. This is a very powerful feature, yet a bit mysterious to configure as the configuration steps have changed since the 2010 version. I’ll outline the steps below:
- Using Server Manager, ensure the Windows TIFF iFilter feature is enabled on each crawl server
- Open the Local Group Policy Editor and locate the OCR folder beneath Computer Configuration > Administrative Templates.
- Edit the policy setting for “Select OCR languages from a code page”. Choose Enabled and select the appropriate languages.
- Open the SharePoint Management Shell (using Run as Administrator) and run the following commands to configure content parsing for TIFF images.
$ssa = Get-SPEnterpriseSearchServiceApplication New-SPEnterpriseSearchFileFormat -SearchApplication $ssa tif "TIFF Image File" "image/tiff" New-SPEnterpriseSearchFileFormat -SearchApplication $ssa tiff "TIFF Image File" "image/tiff"</pre>
- Restart the SharePoint Search Host Controller service.
- Open the Search Service Application administration. Under the Crawling navigation item, navigate to File Types. Add two new File Types for tif and tiff.
- Perform a Full Crawl of your content.