English Deutsch Français Italiano Español Português 繁體中文 Bahasa Indonesia Tiếng Việt ภาษาไทย
All categories

I am developing a web page with ASP.NET 2.0, where I come across a situation i.e I have an option for the user of the page to upload a document. I have to search certain keywords in the documents and also display the contents of the document in a text box in the page. For this I think I need to convert the document to a text for better search.

2006-12-10 19:27:44 · 3 answers · asked by gurushun 1 in Computers & Internet Programming & Design

3 answers

add a reference to the Microsoft Word type library.

Then use the code some thing similar to
Dim word As New Microsoft.Office.Interop.Word.Application
Dim document As Microsoft.Office.Interop.Word.Document
document = word.Documents.Open("c:\test.doc")



Once you Open the document in the winword do a selectall. Call copy text function write it to an out put file using file functions.

Do search you may get code

2006-12-11 05:24:50 · answer #1 · answered by raju 5 · 0 0

relies upon on how the PDF record replaced into created.... some classes create a PDF record with techniques from replacing the unique record right into a "snap shots" or image of the unique web page. there's no thanks to extract textual content from one in each and every of those PDF files. the actually Adobe Acrobat (author of the PDF format) can save files otherwise so as that the files must be extracted from them as textual content. yet you should have the $four hundred Acrobat software to extract the files. note itself does not have a fashion to extract techniques from a PDF.

2016-11-30 10:23:45 · answer #2 · answered by ? 4 · 0 0

word is an proprietry format...
try going through the code of open office maybe u get a clue

2006-12-10 21:16:26 · answer #3 · answered by Pegasus 3 · 0 0

fedest.com, questions and answers