Search code examples
sql-serverazuresearchazure-cognitive-search

Indexing text documents by Azure Search Service


Azure's documentation suggests that we should leverage blobs to be able to index documents like MS Word, PDF, etc. We have an Azure SQL Server database of thousands of documents stored in a table's nvarchar(MAX) field. The nature of the contents in each database record is in plain English text. In fact the application converted the PDF / MS Word into plain text and stored in database.

My question is that would it be possible to index the stored "documents" in database in the same way as Azure would do against blobs? I know how to create an SQL Azure indexer but I'd like to make sure that the way that the underneath search performs against blobs will be the same for documents stored in database table.

Thanks in advance!


Solution

  • This is not currently possible - document extraction can only be done on blobs stored in Azure storage.