Ricebridge
Search This Site
Feb 04 2012 16:52 UTC


$15 Gift Certificate for every bug you find.

MaxMind GeoIP
This IP address to country database is provided as a CSV file that can easily be read by CSV Manager.

Got a question for us?
Just Ask!

Bookmark Ricebridge Java Components - user friendly and well documented at del.icio.us Digg Ricebridge Java Components - user friendly and well documented at Digg.com Bookmark Ricebridge Java Components - user friendly and well documented at reddit.com Bookmark Ricebridge Java Components - user friendly and well documented at YahooMyWeb Bookmark Ricebridge Java Components - user friendly and well documented at Spurl.net Bookmark Ricebridge Java Components - user friendly and well documented at Simpy.com Bookmark Polyphasic Mutants at NewsVine Blink this Ricebridge Java Components - user friendly and well documented at blinklist.com Bookmark Ricebridge Java Components - user friendly and well documented at Furl.net Fark Ricebridge Java Components - user friendly and well documented at Fark.com

CSV Formats

The CSV file format has no formal definition, despite its common use as a data exchange file format. The closest thing to a formal definition is probably RFC 4180, the MIME type for CSV files. The most practical definition is probably the Microsoft Excel CSV format (this is the default format of Ricebridge CSV Manager). Although CSV nominally stands for Comma Separated Values, even the comma is not mandatory, with tabs and colons also commonly used as field separation characters.

The General Case: DSV Format

It is more useful to consider the CSV format as a specific instance of a more general file format, which some have named DSV format, for Delimiter Separated Values. legitimate examples of DSV files then include all the variants that Excel recognises, but includes common UNIX formats such as the passwd file. The general case simply recognises that the fundamental structure is one of repeated fields within repeated rows, and that rows and fields are separated by designated characters.

Formal Definitions

For a more formal definition, Creatyvist Software have an excellent article on the finer points of the CSV format. The reference document included with this documentation also describes the formal grammar used by Ricebridge CSV Manager for parsing CSV files. On the surface CSV files seem fairly easy to parse, but there are number of issues that have to be dealt with before you can be sure of parsing any CSV file.

The critical problem with the CSV format is how to deal with commas and newlines (or any other separation characters) inside a data field. Since it is perfectly possible that data fields can contain any characters, especially for free format data such as names, addresses or short pieces of text, this is not an issue that can be ignored.

The most common way to deal with separators inside data fields, is to quote the data field. While this is not particularly problematic for commas, it does mean that you cannot parse a CSV file by simple breaking it up into input lines, as some newline characters may be inside quotes. Thus nearly all quick scripts and handwritten CSV parsers will fail in this case.

Another approach to dealing with separators is to escape them with a backslash or other character. This approach is more common on UNIX systems. While this does have the advantage that input lines does correspond directly to data rows, it still means that one cannot split data rows into data fields by simply splitting on commas, as these may in turn be escaped.

You can also add the various flavours of newline encoding to these parsing annoyances and compound that with the fact that external CSV files may combine various data escaping strategies, and you will find that CSV files are not as simple as they appear on the surface.

To help you handle any CSV file that you need to parse, CSV Manager provides you with a default set of common format variations, and also allows you to adjust each individual format variation so that you can get your data into the application. Defaults formats are provided for Excel, Mac files, passwd files, and general UNIX files.


Why bother hand-coding yet another CSV parser? Let us deal with the bugs. We make it simple and easy to focus on the real work; delivering your project on time.

Try the Online Demo Download Now for $170.00 More Licensing options Why should I buy this component?

If you need a site license, you can also download CSV Manager for $970.00 If you are a registered charity, academic institution, non-profit or government body, we can provide CSV Manager at zero cost.

How does CSV Manager load your data? We provide five options:

Need to load your data into a different data structure? Please tell us!

FREE Email Support

All CSV Manager licenses include a FREE 6 month Email Support Package, worth $1500.00.

Email Support Package

FREE XML Manager

All CSV Manager licenses include a FREE XML Manager single developer license, worth $170.00.

Convert CSV to XML

Any CSV Format

The CSV Manager component can handle any style of tabular text data, from Excel CSV files to the UNIX passwd file.

CSV Formats

"Your product actually did help a lot! I was able to accelerate my development time quite a bit and your service support was also real nice."
Luis Garcia,
Independent Contractor
"An event-based CSV parser is so much more flexible and elegant than the free alternatives that I found. We use it for applications which require a mixture of XML and CSV data. This requires a parser that performs well, uses memory efficiently, simplifies data access and accurately formats output data. We found the Ricebridge parser easy to work with, conspicuously well documented and capable of handling large (500kb+) files. That's a big thumbs up from all the Gaisan team."
Shane Dempsey,
Gaisan
"I would have spent all day trying to read CSV fields that contained commas. I bought your software about 15 minutes ago and my problem is ALREADY solved. Thank you very, very much for a quality piece of software."
Adrian Klingel,
Illumaware
"Ricebridge CSV Manager was really easy to integrate into our existing system, and has been completely reliable for data transfer, not to mention a big time saver!"
Fred Crowe, Speech-Writers
comment on this page Home | Search | About Us | Contact Us | Our Products | Documentation | Resources | Login
Copyright © 2004-2012 Ricebridge. All Rights Reserved.