Comments (1)
Can hack the load of colm programs to break ranges that cross the 0x7f boundary. Provides a nice quick and dirty. This hack modifies all ranges, regardless of whether or not they are specified using hex.
Probably makes sense to switch to an unsigned character. Ragel is signed by default because of origins in C and pervasiveness of char type. But probably doesn't make much sense in Colm.
If we don't switch to unsigned then maybe right solution is to implement this just for ranges specified in hex. Although that does add an exception to the implementation that takes up some mental space.
diff --git a/colm/parsetree.cc b/colm/parsetree.cc index 572f061..5791973 100644 --- a/colm/parsetree.cc +++ b/colm/parsetree.cc @@ -1228,16 +1228,38 @@ FsmGraph *Range::walk( Compiler *pd ) delete lowerFsm; delete upperFsm; + bool span0 = false; + if ( lowKey >= 0 && highKey < 0 ) + span0 = true; + /* Validate the range. */ - if ( lowKey > highKey ) { + if ( !span0 && lowKey > highKey ) { /* Recover by setting upper to lower; */ error(lowerLit->loc) << "lower end of range is greater then upper end" << endl; highKey = lowKey; } /* Return the range now that it is validated. */ - FsmGraph *retFsm = new FsmGraph(); - retFsm->rangeFsm( lowKey, highKey ); + FsmGraph *retFsm = 0; + if ( span0 ) { + /* Usual case. */ + FsmGraph *first = new FsmGraph(); + FsmGraph *second = new FsmGraph(); + Key k128 = 127; + Key kn127 = -128; + first->rangeFsm( lowKey, k128 ); + second->rangeFsm( kn127, highKey ); + + first->unionOp( second ); + first->minimizePartition2(); + retFsm = first; + } + else { + /* Usual case. */ + retFsm = new FsmGraph(); + retFsm->rangeFsm( lowKey, highKey ); + } + return retFsm; }
from colm.
Related Issues (20)
- b: command not found HOT 10
- Staticless build fails HOT 1
- Data files should be installed to pkgdatadir
- Building under WSL2
- Building colm dies looking for libcolm.a HOT 3
- colm failing to parse a legal pattern
- Heap buffer overflow in data_undo_consume_data(stream.c)
- colm tests fail to build on Rosetta with undefined symbols in test/rlhc.d
- Consider new release? HOT 1
- Universal build fails on 10.5.8: In member function 'void TableArray::stringGenerate(long long int)': error: duplicate case value
- bytecode.c needs #include "config.h", otherwise HAVE_SYS_WAIT_H isn't visible HOT 1
- One test fails
- Build on MINGW failed
- Fails to compile with Clang AddressSanitizer
- Grammar railroad diagram HOT 1
- TXL not free? HOT 1
- reserved identifier violation
- Completion of error handling
- #define ⇒ enum?
- Introduction of namespaces
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from colm.