Comments (2)
Any updates on this issue?
Regards,
from fast-cpp-csv-parser.
Hi,
this problem already appeared previously and I decided to not fix it at that
time. The reasons were:
- CSV files that contain newlines within a line that do not separate lines are
defective in my opinion. The first step should consist of fixing the file. - Splitting up the lines is done in LineReader. Unescaping quotes is done in
CSVReader but only if the right policies are set. LineReader knows nothing
about quotes and thus can not know whether a newlines is within quotes or not.
The whole quoting logic is contained within a CSVReader policy. I therefore
see no straightforward way of making this "just work" without major interface
breaking changes to the design. - The most frequent use case consists of parsing CSV-files that only contain
numbers and perhaps non-escaped strings. I do not want to add logic to
LineReader that makes this use case slower. - Windows vs Linux newlines, i.e., "\r\n" vs "\n", make this whole problem
significantly worse. Note that, these two can appear simultaneously within
one file. Sidenote: This whole Linux vs Windows is also the reason why the
newline characters are hardcoded within LineReader and not part of a
template parameter, as they should in an ideal world.
If you know of a good way to support such ill-formed CSV files, then I can add
the change. If not, then this problem will stay.
Best Regards,
Ben Strasser
On Sat, 06 Jun 2015 12:06:31 -0700
Niranjan Godbole [email protected] wrote:
Any updates on this issue?
Regards,
Reply to this email directly or view it on GitHub:
#2 (comment)
from fast-cpp-csv-parser.
Related Issues (20)
- How can i skip specified line? HOT 1
- Segmentation fault when running inside Boost Unit Test Framework HOT 2
- free(): invalid pointer HOT 4
- i18n README HOT 1
- My csv files can have changing number of columns
- Add ability to detect NULL values HOT 3
- Parse Single Line Without Loading File HOT 3
- Loss of precision on float reading HOT 4
- Indexing read_row HOT 1
- Count rows without processing them? HOT 1
- Possible to ignore columns in read_row()?
- Can the parsing performance be improved by using a precomputed index? HOT 2
- Progressbar support HOT 2
- Read file line by line
- Hope for more examples for code noobs
- Is it possible that I can wrap this library with lz4?
- Can I read the csv from console with this library?
- Usage of set_file_line is not clear HOT 1
- C4996 (function or variable may be unsafe) error for strncpy and fopen HOT 2
- Parsing a CSV with unknown number of columns HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from fast-cpp-csv-parser.