Newest Project: Text “parsers” for MS Word, WordPerfect®, RTF, HTML, PDF, & Excel Documents


Well, I have taken on a major php-to-aps.net upgrade, this month.  The Project is a “resume" storage applications that takes common  documents and “parses” the text so it can be inserted into a SQL table as text.  The Parsers must strip out all “non printable characters”, white space, headers, footers, etc.

The php version, uses regular expressions and common string functions to manipulate the “garbage” out of the documents so all that is left is the written text. As I am not completely fluent in Regular Expressions, this is somewhat of a “troublesome” project.

More to follow………..

Advertisements
This entry was posted in Software. Bookmark the permalink.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s