text search with PDF files in SQL Server 2008

I have SQL Server 2008 R2 and am trying to implement full-text search on a PDF BLOB.

I have installed the iFilter from Adobe and confirmed it is installed

Using

EXEC sp_help_fulltext_system_components 'filter';

filter .pdf E8978DA6-047F-4E3D-9C78-CDBE46041603
C:Program FilesAdobeAdobe PDF iFilter 11 for 64-bit platformsbinPDFFilter.dll
11.0.1.36 Adobe Systems, Inc.

I then created a fulltext catalog for the FT Index and created the FT index

CREATE FULLTEXT INDEX ON Compliance_Updates
( 
FileDesc
 Language 1033,
 FileData
   TYPE COLUMN FileDataType
) 
 KEY INDEX PK_Compliance_Updates
     ON FT_Compliance_Updates; 

I then forced a rebuild of the index after adding some PDF's to the table. The index shows..

Catalogue Size : 0MB
Item Count : 2
Unique Key Count : 7
Name : FT_Compliance_Updates
Last Population Date : 12/11/2013 09:36
Population Status : Idle

However, when I perform the following search, I get zero results...

SELECT FileID, FileDesc, PubDate 
FROM Compliance_Updates 
WHERE CONTAINS(FileData, 'mortgage')

I've tried deleting the catalog, removing all the table records and indexes (including PK), re-running the iFilter install

exec sp_fulltext_service 'load_os_resources', 1;
exec sp_fulltext_service 'verify_signature', 0;

Restarting SQL Server, re-creating the indexes and FT catalog, nothing seems to work?


  • Version 11.x didn't work for me, but 9.x worked.
  • Also you need to add C:Program FilesAdobeAdobe PDF iFilter 9 for 64-bit platformsbin at the end of the System's PATH variable as well. Start > Control Panel > System > Advanced Environment Variables -> System Variables -> find PATH

  • Version 11.x didn't work for me too. 9.x works :) It is hard to find 9.x 64 Bit on the website of Adobe. But on FTP you could find it here: ftp://ftp.adobe.com/pub/adobe/acrobat/win/9.x/


    FWIW, even with SQL Server 2014, I was not able to get Version 11.x to work and so downloaded Version 9.x from the FTP link kindly provided above. Version 9.x still seems to be the way to go as it also worked for me! :^)

    链接地址: http://www.djcxy.com/p/55876.html

    上一篇: 在文件列表上全文搜索

    下一篇: 使用SQL Server 2008中的PDF文件进行文本搜索