text search with PDF files in SQL Server 2008
I have SQL Server 2008 R2 and am trying to implement full-text search on a PDF BLOB.
I have installed the iFilter from Adobe and confirmed it is installed
Using
EXEC sp_help_fulltext_system_components 'filter';
filter .pdf E8978DA6-047F-4E3D-9C78-CDBE46041603
C:Program FilesAdobeAdobe PDF iFilter 11 for 64-bit platformsbinPDFFilter.dll
11.0.1.36 Adobe Systems, Inc.
I then created a fulltext catalog for the FT Index and created the FT index
CREATE FULLTEXT INDEX ON Compliance_Updates
(
FileDesc
Language 1033,
FileData
TYPE COLUMN FileDataType
)
KEY INDEX PK_Compliance_Updates
ON FT_Compliance_Updates;
I then forced a rebuild of the index after adding some PDF's to the table. The index shows..
Catalogue Size : 0MB
Item Count : 2
Unique Key Count : 7
Name : FT_Compliance_Updates
Last Population Date : 12/11/2013 09:36
Population Status : Idle
However, when I perform the following search, I get zero results...
SELECT FileID, FileDesc, PubDate
FROM Compliance_Updates
WHERE CONTAINS(FileData, 'mortgage')
I've tried deleting the catalog, removing all the table records and indexes (including PK), re-running the iFilter install
exec sp_fulltext_service 'load_os_resources', 1;
exec sp_fulltext_service 'verify_signature', 0;
Restarting SQL Server, re-creating the indexes and FT catalog, nothing seems to work?
C:Program FilesAdobeAdobe PDF iFilter 9 for 64-bit platformsbin
at the end of the System's PATH variable as well. Start > Control Panel > System > Advanced Environment Variables -> System Variables -> find PATH Version 11.x didn't work for me too. 9.x works :) It is hard to find 9.x 64 Bit on the website of Adobe. But on FTP you could find it here: ftp://ftp.adobe.com/pub/adobe/acrobat/win/9.x/
FWIW, even with SQL Server 2014, I was not able to get Version 11.x to work and so downloaded Version 9.x from the FTP link kindly provided above. Version 9.x still seems to be the way to go as it also worked for me! :^)
链接地址: http://www.djcxy.com/p/55876.html上一篇: 在文件列表上全文搜索