advertisement
javaboutique
Search Tips
Articles  |   Tutorials  |   Reviews  |   Tools  |   by Category  |   by Date  |   by Name  |   Submit  |   Source  |   Forums  |  
javaboutique
Browse DevX


Partners & Affiliates











advertisement

Reviews : Davisor Offisor 1.5.1 :

Review: Davisor Offisor 1.5.1

by Drew Falkman

Summary

Sometimes it's the most mundane, seemingly basic tasks that end up taking a lot of time and effort to deal with. I've found this to be particularly true with content management - especially dealing with Microsoft Word documents and getting them to work on Web sites. Davisor Offisor is a Java tool library to help developers handle Word documents and get them into an easier format to work with: eXtensible Markup Language (XML). In this review, I will take a look at Offisor to see if this can help us with development. More Information

Introduction

As Internet technology has evolved, so too have the formats of documents -- we now have PDF, a pretty solid HTML standard and XML. In theory, as more end-users move towards universal document formats, this should make the prospect of content management easier. Unfortunately, in most circumstances all of these newer formats require special tools or technical understanding. And let's be realistic; most people still use Microsoft Word. Anyone who has worked with Word documents, and even the HTML/XML output of Word documents, knows that this is not an easy format to work with. Tools like Macromedia Dreamweaver MX even have special processes to, as Dreamweaver calls it, "Clean up Word HTML". Microsoft seems to be addressing this issue by adding significant XML support in Office 2003, but many users are still using Word 2002, 2000, 97 and earlier or don't have the understanding (or inclination to obtain it) necessary to work with XML. It is in this arena where Davisor Offisor can help.

How Offisor Works

One of the nice things about Offisor is that it doesn't require any proprietary plug-ins or libraries, such as you might expect when working with Microsoft formats. Offisor will work in any native Java application, on Windows, Linux or whatever. The only requirement is a SAX (1 or 2) compatible XML parser. In version 1.5.1, Offisor will handle two basic types of files; standard Word docs (versions 6, 95, 97 and 2000, and though undocumented I had luck with 2002) and "real-world" HTML files. The real- world HTML parser is a nice addition to the package, as it will parse looser and sloppier (as their Davisor calls it, "almost- but-not-quite compliant") HTML into XML, allowing developers to create a universal XML storage paradigm for any HTML and Word documents that are imported into an application.

Using Offisor is straightforward to say the least. There are two primary classes that are used to parse documents. com.davisor.ms.doc.DocParser and com.davisor.xml.html.HTMLParser. As you have probably surmised, these will process Word docs and HTML documents respectively. The examples included with Offisor are actually quite handy and provided a good look at how to use the API to transform documents. Additionally, the API is quite comprehensive and a number of core classes include utilities, interfaces and exceptions that you can use when coding with Offisor.

Setup, Installation and Documentation

Setting up Offisor on my computer was a simple task. The zip I downloaded included a WAR file which I deployed on my JRun 4 server. Everything worked on the first try! The download also includes the examples and a good bit of documentation. The documentation includes the Offisor user's manual, the API docs, a guide for obfuscating Offisor code (if a developer wanted to include this code in a larger software package), information about the output XML format, some sample transformation style documents and a version history. Frankly, this was more than I expected from a relatively simple tool(from an implementation standpoint at least).

How to Add Java Applets to Your Site

New on the Java Boutique:

New Review:

Time Management Made Easy with the Quartz Enterprise Job Scheduler
Why not just use the Java timer API? This open source scheduling API boasts simplicity, ease-of-integration, a well-rounded feature set, and it's free!

New Applet:

Reverse Complement
Reverse Complement is a simple applet that converts DNA or RNA sequences into three useful formats.

Elsewhere on internet.com:

WebDeveloper Java
Lots of Java information on webdeveloper.com

WDVL Java
Thorough Java resource at the Web Developer's Virtual Library.

ScriptSearch Java
Hundreds of free Java code files to download.

jGuru: Your View of the Java Universe
Customizable portal with online training, FAQs, regular news updates, and tutorials.

 DevX Skillbuilding from IBM developerWorks
 RIA Run Contest: Build Next-Gen Apps in Microsoft Silverlight 2
 Avaya DevConnect Center
 Intel Go Parallel Portal
 Internet.com eBook Library
 Microsoft RIA Development Center
 Destination .NET
XML error: not well-formed (invalid token) at line 53
advertisement
Receive Articles via our XML/RSS feed
Receive Articles via our XML/RSS feed

JavaBytes
Internet Cyclone
This powerful, easy-to-use, internet optimizer is for Windows 95, 98, ME, NT, 2000 and XP. It's designed to automatically optimize your Windows settings, boosting your Internet connection up to 200%.

SaaS Tool Offers Custom Database Development
Microsoft’s Automated Agent: Can We Talk?
Borland Finally Sells CodeGear
Red Hat Heads for the JON 2.0
Out with the Old, in with the New at JavaOne
Trolltech Expands WebKit Footprint
Oracle: Eating its Own Open Source Food
Big Money and Open Source May Not Compute
Open Source Embrace Gives Sun New Fans
NetBeans, OpenSolaris Also in Spotlight at JavaOne

Eliminate Fragmentation Frustration with Netbiscuits
Taming Trees: Building Branching Structures
Clean Up Function Syntax Mess with decltype
Sutter Speaks: The Future of Concurrency
INTEL SCAVENGER HUNT, LENOVO X300 AND APPLE IPOD TOUCH GIVEAWAY (the "Giveaway")
Comparing Multi-Core Processors for Server Virtualization
Intel® Desktop Business Computing Solutions
Intel: What Downturn?
Managing the Evolving Data Center
Implement Drag and Drop in Your Windows Forms Applications

Advertising Info  |   Member Services  |   Contact Us  |   Help  |   Feedback  |   Site Map  |   Network Map  |   About



JupiterOnlineMedia

internet.comearthweb.comDevx.commediabistro.comGraphics.com

Search:

Jupitermedia Corporation has two divisions: Jupiterimages and JupiterOnlineMedia

Jupitermedia Corporate Info


Legal Notices, Licensing, Reprints, & Permissions, Privacy Policy.

Advertise | Newsletters | Tech Jobs | Shopping | E-mail Offers

Solutions
Whitepapers and eBooks
Microsoft Article: HyperV-The Killer Feature in WinServer ‘08
Avaya Article: How to Feed Data into the Avaya Event Processor
Microsoft Article: Install What You Need with Win Server ‘08
HP eBook: Putting the Green into IT
Whitepaper: HP Integrated Citrix XenServer for HP ProLiant Servers
Intel Go Parallel Portal: Interview with C++ Guru Herb Sutter, Part 1
Intel Go Parallel Portal: Interview with C++ Guru Herb Sutter, Part 2--The Future of Concurrency
Avaya Article: Setting Up a SIP A/S Development Environment
IBM Article: How Cool Is Your Data Center?
Microsoft Article: Managing Virtual Machines with Microsoft System Center
HP eBook: Storage Networking , Part 1
Microsoft Article: Solving Data Center Complexity with Microsoft System Center Configuration Manager 2007
MORE WHITEPAPERS, EBOOKS, AND ARTICLES
Webcasts
Intel Video: Are Multi-core Processors Here to Stay?
On-Demand Webcast: Five Virtualization Trends to Watch
HP Video: Page Cost Calculator
Intel Video: APIs for Parallel Programming
HP Webcast: Storage Is Changing Fast - Be Ready or Be Left Behind
Microsoft Silverlight Video: Creating Fading Controls with Expression Design and Expression Blend 2
MORE WEBCASTS, PODCASTS, AND VIDEOS
Downloads and eKits
Sun Download: Solaris 8 Migration Assistant
Sybase Download: SQL Anywhere Developer Edition
Red Gate Download: SQL Backup Pro and free DBA Best Practices eBook
Red Gate Download: SQL Compare Pro 6
Iron Speed Designer Application Generator
MORE DOWNLOADS, EKITS, AND FREE TRIALS
Tutorials and Demos
How-to-Article: Preparing for Hyper-Threading Technology and Dual Core Technology
eTouch PDF: Conquering the Tyranny of E-Mail and Word Processors
IBM Article: Collaborating in the High-Performance Workplace
HP Demo: StorageWorks EVA4400
Intel Featured Algorhythm: Intel Threading Building Blocks--The Pipeline Class
Microsoft How-to Article: Get Going with Silverlight and Windows Live
MORE TUTORIALS, DEMOS AND STEP-BY-STEP GUIDES