Products

Solutions

Resources

Partners

Community

Blog

About

QA

Ideas Test

New Community Website

Ordinarily, you'd be at the right spot, but we've recently launched a brand new community website... For the community, by the community.

Yay... Take Me to the Community!

Welcome to the DNN Community Forums, your preferred source of online community support for all things related to DNN.
In order to participate you must be a registered DNNizen

HomeHomeArchived Discus...Archived Discus...Developing Under Previous Versions of .NETDeveloping Under Previous Versions of .NETASP.Net 2.0ASP.Net 2.0Parse HTML out of a Word DocumentParse HTML out of a Word Document
Previous
 
Next
New Post
9/14/2007 1:46 AM
 

Does anybody have a way to get the contents of a word document to send it to a database?

 

I have tried a few Office Automation examples with less than stellar results and wanted some ideas on how to open a uploaded word document parse out its contents as HTML and save that HTMl to a database.

 

Any ideas anybody?


Dylan Barber http://www.braindice.com - Dotnetnuke development classes - skins and modules
 
New Post
9/14/2007 7:34 AM
 

If you can mandate Office 2007 (internal project), there's a pretty good API (http://msdn2.microsoft.com/en-us/library/bb739835.aspx#ManipulatingWord2007OpenXMLFiles_Overview). 

If you need to support earlier versions, you could purchase a control such as Apose.Words ($899).

If that's not an option either you're stuck with COM, and instanciating the app on the server.  This again would have to be an internal project, as it requires Microsoft Word to be installed on the server, and PIAs installed (http://www.microsoft.com/downloads/details.aspx?FamilyId=C41BD61E-3060-4F71-A6B4-01FEBA508E52&displaylang=en)

From there you can use the ApplicationClass.Documents.Open, and DocumentClass.SaveAs to save the file as html, and a filestream to read it to a string.

 
New Post
9/14/2007 12:06 PM
 

I looked at Apose.Word seems expensive - I could do it only for Office 2007 docs that might be my solution but the Officee Automation solution totally sucks


Dylan Barber http://www.braindice.com - Dotnetnuke development classes - skins and modules
 
Previous
 
Next
HomeHomeArchived Discus...Archived Discus...Developing Under Previous Versions of .NETDeveloping Under Previous Versions of .NETASP.Net 2.0ASP.Net 2.0Parse HTML out of a Word DocumentParse HTML out of a Word Document


These Forums are dedicated to discussion of DNN Platform and Evoq Solutions.

For the benefit of the community and to protect the integrity of the ecosystem, please observe the following posting guidelines:

  1. No Advertising. This includes promotion of commercial and non-commercial products or services which are not directly related to DNN.
  2. No vendor trolling / poaching. If someone posts about a vendor issue, allow the vendor or other customers to respond. Any post that looks like trolling / poaching will be removed.
  3. Discussion or promotion of DNN Platform product releases under a different brand name are strictly prohibited.
  4. No Flaming or Trolling.
  5. No Profanity, Racism, or Prejudice.
  6. Site Moderators have the final word on approving / removing a thread or post or comment.
  7. English language posting only, please.
What is Liquid Content?
Find Out
What is Liquid Content?
Find Out
What is Liquid Content?
Find Out