Screen shots showing how you might use ParseRat
(The images may be slow to load. This is necessary to preserve quality in the full-size images.)
|
|
Overview |
![]() Click images to enlarge. |
ParseRat works by:
|
| Finding input fields | |
|
If the input file is record-oriented, the user may choose to treat it as fixed format or as delimited. When ParseRat recognises common delimiters it will select them by default, but the user may over-ride with his/her choice of record format and may select any delimiter character. For example, if the input data is being pasted from the system clipboard after capture from another matrix-oriented program like Excel the data will be tab-delimited. ParseRat's initial assumption will recognise this and separate the input fields accordingly. |
|
If the input file is not record-oriented (e.g. it may be a redirected print file from another process or some other type of multi-line data blocks), ParseRat permits data to be extracted from the section or page headers, footers and single or multiple data blocks on the page. An example would be sheets of labels or of catalogue entries. The data may be in fixed locations on the page or it may be identified by its position in relation to specifed "tags" (useful for extracting data from emails). |
|
Emails, web pages and XML files are examples of tag-oriented data. In the case of XML files, ParseRat will make an automatic initial selection of tags which may be edited by the user if the ones chosen are not appropriate. In the case of other "tagged" input formats such as emails or HTML tables in pages saved from websites or captured by users on the system clipboard, the user identifies the tags of interest. |
|
Many files consist of binary data, with no delimiters. Frequently such files consist of an initial header block, usually giving details of the structure of the remainder of the file. Following the header (if any) are records of a fixed length but with no delimiters. These records may consist of text-only material or binary numbers or a combination of both. Even if you do not know the formal structure of the file format, in many cases you can determine the structure of a given file by inspecting it in ParseRat (you can usually see how "columns" line up, even if you do not have the file definition). You may then export its data in a manner similar to that for fixed format files. ParseRat is able to extract data from most structured binary files, correctly handling all binary data representations including zoned and packed decimal in both Intel and Motorola conventions (i.e. both "endianisms") ASCII and EBCDIC. |
| Parsing and processing the data from the fields | |
|
|
Optionally, input data blocks may be included in the output only if a user-selected field equals or contains a user-specified value. |
|
The data in the input fields may be combined in various sequences, further split into components or processed in other ways. If there is not a predefined process which suits the user's needs, a "plugin" program may be written in virtually any computer language to further process the data. In the example at the left, the predefined combination process assembles fields from the input data in three different ways. Additionally a plugin program called "reverse.exe" was written to take the value of "Combination 1", reverse it and return it as "Combination 4". This gives great flexibility when data needs to be generated from the contents of the input fields in ways determined only by the user. For example, it could be used to create passwords or registration codes by "hashing" values from selected input fields. |
|
Fields (or groups of fields combined) containing name data may be parsed in sophisticated manners. Titles (if any), first name, middle names, last names and suffices (if any) are parsed out into individual elements. Different (user specified) input formats may be re-assembled into other formats for output. e.g. a "Phone Book" pattern name may be converted to a "Mailing List" pattern format. Gender is automatically determined for 10,000 pre-programmed first names and users may add up to another 10,000 names. Gender-correct titles and salutations are generated suitable for use in letters, with automatic recognition of "couples" if male and female names are both present in the name. Words and word sequences to be omitted in the generation of name data may be specified (e.g. "Fax Line", "Teen Phone", etc). Multi word surnames are recognised with most prefix words pre-programmed (e.g. Van, De, Di, etc) and provision for the user to add more. |
|
Street address data is intelligently parsed into its essential components and may be re-assembled into a user specified standard address sequence to suit a particular postal authority. Standard abbreviations are generated for address elements accordng to rules which may be edited by the user. The generation of Odd/Even codes and the ability for ParseRat to divide a numeric field by 100 makes easy the generation of canvasser lists by hundred-block and side of street. Similar parsing is performed for City, State/Province and postal code. Postal code patterns for many countries are automatically recognised if present in a field. |
|
Date and time information is handled for all formats. Many formats are automatically recognised, including dBase dates, Julian Date (days since 4713 BC) and Calendar Date (Seconds since January 1st, 1970 - used in many computer applications, including Internet bookmark data). Dates may be exported in every format including numeric, full alphabetic, abbreviated alpha, US, UK and metric sequences. |
|
Measurement Unit Conversion is provided for fields selected or assembled using the Combo panel. Over 2,300 built-in conversions are provided or the user may specify any linear conversion (i.e. of the Y=A + BX pattern). In the example shown at left, the Combo1 field has been selected from an input field containing the number 63. The conversion "Hectares to Acres" has been selected and the result 155.676 is placed in the Convert1 parser generated field for placement in the output record. |
|
| Other parsing and processing options are available and are best evaluated by downloading the free thirty day evaluation version. This may be accomplished by clicking the button at the left. |
| Assembling and exporting data records in a variety of formats. | |
|
The user may choose from many choices of output format, for the file as a whole as well as for individual fields. The user may optionally insert header and trailer blocks ahead of and behind the database records (combining this with fixed text in some fields allows the automatic generation of web pages with tables of catalog items, links, etc).
To assist in mailing lists for "multiple up" forms, there is provision to "interleave" records in multiple sequences within the file. |
Rated
at TUCOWS
![]() |
ParseRat's fully functional 30 day trial version may be downloaded from many places including directly from Guy Software:
While the directly usable installer is very convenient, it is an EXE file and some systems block these or warn against their use. However you are downloading the file directly from the Guy Software site and therefore no other party has had the opportunity to tamper with it. Click here to download parserat.zip (approx 1 megabyte), a zipped archive copy requiring an unzipping tool (see below). | ||
| If you are located a long way from Chigaco (this site's physical location) you might get a faster download by connecting to one of the mirror sites of the SimTel file collection, located all around the world. To start your download of parserat.zip from the SimTel file collection location nearest you, click here to select your nearest location. | |||
| If you are unfamiliar with installing downloaded software you may click here to see a tutorial. | |||
| You may view the help file on line by clicking here. (If your browser does not know what to do with a HLP file, it might ask you which application to open it with. If so, please select WINHELP32.EXE in your Windows directory.) | |||
![]() |
Although the help file contains all of the information, you may download a ZIP file containing the manual in MS Word 2.0 format (which most word processors can read) by clicking here or a file in PDF format which needs the free Adobe Acrobat reader by clicking here. | ||
| Most computers already have a copy of the free Adobe Acrobat reader. If yours doesn't please click here. | |||
Neither the evaluation nor the paid-up version display third-party advertising, or send information from your system to any other place. |
Pricing for ParseRat
Prices in Pounds (£), Euros (€) and other currencies are conversions from US Dollars and will be recalculated based on currency rates at the time of order.
| Standard prices are for delivery by Electronic Download (no shipped media). |
Basic Price |
For shipped media as below |
Additional charge |
|
|
Single User |
$49.95 (USD) (£35.23, €38.87) |
Disk and printed manual, by airmail - USA/Canada |
$10.00 (£7.05, €7.78) |
|
|
Six User Package* |
$199.95 (£141.04, €155.60) |
Disk and printed manual, by airmail - other |
$20.00 (£14.11, €15.56) |
|
|
Twenty User Package |
$549.95 (£387.91, €427.97) |
Disk and printed manual by express courier |
$50.00 (£35.27, €38.91) | |
|
Forty User Package |
$949.95 (£670.05, €739.25) |
| ||
Non pre-paid purchase orders are accepted only from government and corporate entities and are subject to an additional handling charge of $10.00 USD (£7.05, €7.78) and a minimum order of $200.00 USD (£141.07, €155.64).
For the single copy licence, if PARSERAT is installed on the internal hard drive (or equivalent) of a computer and one person uses that computer more than 80% of the time it is in use, then that person may use PARSERAT on one other computer.
Any questions about the status of the shipment of the order, refunds, registration options, product details, technical support, volume discounts, dealer pricing, site licenses, non-credit card orders, etc, should be directed to Guy Software (click on the Email icon here). ![]()
Guy Software,
1752 Duchess Avenue
West Vancouver
British Columbia V7V 1P9
Canada
Tel: (604)926-1370
Fax: (604)926-1346
|
Note that Vancouver is in the Pacific Time Zone, 8 hours behind GMT. The current Vancouver time is 06:14:06 PM Wednesday, Mar 10 2010 |
Click here for Guy Software Home Page.