Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Fix typostypo | Victor Uriarte | 2017-11-29 | 1 | -1/+1 |
| | |||||
* | Fix encoding logic/order | Victor Uriarte | 2017-01-10 | 1 | -5/+6 |
| | | | | | | - If user provides an encoding value, use it instead of trying to _guess_ first. - If no value is provided, then decode with default of utf-8, otherwise try with unicode-escape | ||||
* | Fix #315 support utf-8 by default | Tao Wang | 2017-01-11 | 1 | -1/+1 |
| | | | | Signed-off-by: Tao Wang <twang2218@gmail.com> | ||||
* | Correct license link (fixes #288). | Andi Albrecht | 2016-09-14 | 1 | -1/+1 |
| | |||||
* | Decode bytes to unicode in Lexer.get_tokens(). | Oleg Broytman | 2016-08-31 | 1 | -5/+16 |
| | | | | | | | Raise TypeError if the input is neither bytes in a known encoding nor unicode nor a file-like object (file, StringIO). Remove function u(). Add bytes_type to compat. Add tests for non-ascii. | ||||
* | Revert "Parse double dollars (PostgreSQL) as literal strings (fixes #277)." | Victor Uriarte | 2016-08-13 | 1 | -5/+2 |
| | | | | This reverts commit 2893bd1857d685cf892beac3a7429d03cf1a09f1. | ||||
* | Parse double dollars (PostgreSQL) as literal strings (fixes #277). | Andi Albrecht | 2016-08-13 | 1 | -2/+5 |
| | |||||
* | Move file_types list to compat module. | Andi Albrecht | 2016-08-10 | 1 | -9/+1 |
| | |||||
* | Fix a bug: recognize file (Python 2) as a stream | Oleg Broytman | 2016-08-06 | 1 | -1/+7 |
| | |||||
* | Fix parsing of streams (fixes #273). | Andi Albrecht | 2016-07-22 | 1 | -1/+3 |
| | |||||
* | Misc. small code clean-up/comments | Victor Uriarte | 2016-06-18 | 1 | -1/+1 |
| | |||||
* | Redo unicode-encoding on lexer.py | Victor Uriarte | 2016-06-11 | 1 | -11/+4 |
| | |||||
* | Allow re to compile once | Victor Uriarte | 2016-06-04 | 1 | -12/+6 |
| | | | | | Otherwise re will need to recompile everytime a call to tokenize is made. | ||||
* | Simplify multi-line comments | Victor Uriarte | 2016-06-04 | 1 | -28/+6 |
| | |||||
* | Add or Update copyright year to files | Victor Uriarte | 2016-06-04 | 1 | -3/+3 |
| | |||||
* | Replace iter(range(len(...))) with enumerate | Victor Uriarte | 2016-06-02 | 1 | -5/+4 |
| | |||||
* | Pass encoding into lexer as a parameter | Victor Uriarte | 2016-06-02 | 1 | -7/+4 |
| | |||||
* | Simplify handling of encoding in lexer.py | Victor Uriarte | 2016-05-29 | 1 | -22/+10 |
| | |||||
* | Remove encoding guessing on lexer.py | Victor Uriarte | 2016-05-29 | 1 | -11/+1 |
| | |||||
* | Refactor lexer.tokens_unprocessed | Victor Uriarte | 2016-05-29 | 1 | -60/+27 |
| | | | | | | | Remove unused return pos Remove redundant streamer func Remove unreached else statement Clean-up Lexer tokenization | ||||
* | Bid Adieu to metaclass | Victor Uriarte | 2016-05-29 | 1 | -40/+17 |
| | | | | | | | | | | The singleton pattern isn't applicable since only one language is being implemented. Simplify Lexer initialization. Fix compat func `u` on Py3. Signature didn't match Py2. Feature isn't used yet. | ||||
* | Merge process_state and __call__ | Victor Uriarte | 2016-05-29 | 1 | -34/+16 |
| | | | | | | | | Other Changes: Only #pop state is used by multiline comments. Remove unused states Simplify RegEx compilation RegEx Exception too generic | ||||
* | Merge process_tokendef and __call__ | Victor Uriarte | 2016-05-29 | 1 | -16/+6 |
| | | | | Remove unused token_variants | ||||
* | Remove assert statements from lexer.py | Victor Uriarte | 2016-05-29 | 1 | -19/+0 |
| | |||||
* | Remove undocumented ws handlers from lexer.py | Victor Uriarte | 2016-05-29 | 1 | -15/+2 |
| | | | | | Removing nl before tokenizing breaks comments (stripnl/stripall) Remove tab-expansion. | ||||
* | Remove unused func/class/args in lexer.py | Victor Uriarte | 2016-05-29 | 1 | -60/+3 |
| | | | | | | Remove unused apply_filters from lexer.py Remove unused filters function from lexer.py Remove unused arguments get_tokens func | ||||
* | Refactor var Lexer.tokens & func is_keyword | Victor Uriarte | 2016-05-29 | 1 | -68/+3 |
| | |||||
* | Don't call is_keyword for the new 'USING(' regex | koljonen | 2016-05-15 | 1 | -1/+1 |
| | |||||
* | Recognize USING as a keyword in 'USING(', not just in 'USING (' | koljonen | 2016-05-14 | 1 | -0/+1 |
| | | | | These were previously caught by (r'[^\W\d_]\w*(?=[.(])', tokens.Name), so I added a special regex just above that one. | ||||
* | fix issue #215 | Shen Longxing | 2015-12-18 | 1 | -1/+1 |
| | | | Add token for 'END WHILE' | ||||
* | Cleanup module code. | Andi Albrecht | 2015-10-26 | 1 | -1/+1 |
| | |||||
* | Fix tab expansion for Python 3. | Andi Albrecht | 2015-10-26 | 1 | -5/+7 |
| | |||||
* | Use compat module for single Python 2/3 code base. | Andi Albrecht | 2015-10-26 | 1 | -13/+16 |
| | | | | This change includes minor fixes and code cleanup too. | ||||
* | Recognize MSSQL temp tables and distinguish from MySQL comments (fixes #192). | Andi Albrecht | 2015-07-26 | 1 | -3/+3 |
| | |||||
* | Never interpret IN keyword as function name (fixes #183). | Andi Albrecht | 2015-04-12 | 1 | -0/+3 |
| | |||||
* | Fix other regex to identify comments starting with '#' character. | Andi Albrecht | 2015-03-16 | 1 | -1/+1 |
| | |||||
* | Support comments starting with '#' character (fixes #178). | Andi Albrecht | 2015-03-16 | 1 | -1/+1 |
| | |||||
* | get rid of tokens.Punctuation.ArrayIndex, add negative lookbehind for sqlite ↵ | Darik Gamble | 2015-03-04 | 1 | -2/+4 |
| | | | | identifiers | ||||
* | Allow identifiers to start with an underscore (fixes #175). | Andi Albrecht | 2015-03-01 | 1 | -2/+1 |
| | |||||
* | Recognize escaped backslashes within strings | casey | 2015-02-26 | 1 | -2/+1 |
| | | | | | Previously if a single quoted string ended with an escaped backslash, parsing would not consider the string to be terminated. | ||||
* | Fix of problem with multiline treated as stackable while /* /* */ is one ↵ | JacekPliszka | 2015-02-12 | 1 | -1/+7 |
| | | | | comment, not two stacked | ||||
* | (postgresql) Add support for square bracket array indexing, by darikg. | Andi Albrecht | 2015-02-08 | 1 | -0/+1 |
| | | | | Pull request #170 with trivial conflicts resolved. | ||||
* | Add double precision as a built-in datatype | Darik Gamble | 2015-02-06 | 1 | -0/+1 |
| | |||||
* | Fix a regression for identifiers with square bracktes notation (fixes #153). | Andi Albrecht | 2014-12-01 | 1 | -1/+1 |
| | |||||
* | Handle modulo operator without spaces | Gavin Wahl | 2014-08-07 | 1 | -1/+2 |
| | | | | | | | `x %3` should be interpreted as a modulo operation, not a parameter marker. Co-authored-by: Rocky Meza <rmeza@fusionbox.com> | ||||
* | Fix incorrect parsing of string literals with line breaks (fixes #118). | Andi Albrecht | 2013-11-06 | 1 | -2/+2 |
| | |||||
* | Merge pull request #112 from alexander-beedie/master | Andi Albrecht | 2013-10-23 | 1 | -1/+1 |
|\ | | | | | extended JOIN token recognition/lexing (eg: FULL OUTER JOIN, NATURAL JOIN) | ||||
| * | tighten up JOIN regex | Alexander Beedie | 2013-10-17 | 1 | -1/+1 |
| | | | | | | ensure the regex does not match against invalid permutations such as 'LEFT OUTER CROSS JOIN', since CROSS/NATURAL keywords were added | ||||
| * | extend regex to allow recognition of 'NATURAL JOIN' | Alexander Beedie | 2013-10-17 | 1 | -1/+1 |
| | | |||||
| * | add keyword recognition for "FULL OUTER JOIN" and "CROSS JOIN" | Alexander Beedie | 2013-09-02 | 1 | -1/+1 |
| | |