summaryrefslogtreecommitdiff
path: root/ext/unicode
Commit message (Collapse)AuthorAgeFilesLines
...
* A lot of work on making TextIterator support propert codepoint-levelAndrei Zmievski2006-06-241-104/+183
| | | | | offsets and making it more robust in general.
* - Remove support for code units in TextIterator (people shouldn't beAndrei Zmievski2006-06-241-80/+32
| | | | | | | examining individual code units anyway) - Add offset() method. - Add optional locale parameter to the constructor.
* Implement user conversion error handler support. Works as normal errorAndrei Zmievski2006-06-211-2/+2
| | | | | | | | | handler, in that it can return false to make the default one take over. Handler signature is: user_handler($direction, $encoding, $char_byte, $offset, $message) Also removed support for using exceptions in default error handler.
* Implement unicode_set_error_handler() / unicode_restore_error_handler().Andrei Zmievski2006-06-201-1/+62
| | | | | The error handler doesn't do anything yet. (vaporware)
* Rename to str_transliterate().Andrei Zmievski2006-06-153-23/+3
|
* Fixed ZTS buildDmitry Stogov2006-06-151-2/+2
|
* transliterate()Andrei Zmievski2006-06-135-2/+153
|
* Add char_enum_types().Andrei Zmievski2006-05-093-2/+87
|
* Fix and adjust.Andrei Zmievski2006-05-091-11/+8
|
* Add char_enum_names().Andrei Zmievski2006-05-093-0/+120
|
* Been a long day..Andrei Zmievski2006-05-083-1/+58
|
* And going, and going...Andrei Zmievski2006-05-083-11/+78
|
* A few more property functinos.Andrei Zmievski2006-05-083-4/+91
|
* Added constants.c to the windows buildEdin Kadribasic2006-05-081-1/+1
|
* Adding property.c to windows buildFrank M. Kromann2006-05-051-1/+1
|
* Register slightly under half a metric ton of constants.Andrei Zmievski2006-05-054-3/+619
|
* Implement char_from_name().Andrei Zmievski2006-05-041-0/+29
|
* Some more work on property/names stuff.Andrei Zmievski2006-05-043-8/+113
|
* Fix locale functions naming problem.Andrei Zmievski2006-05-041-0/+7
|
* Change prefix to char_ and rename some functions.Andrei Zmievski2006-05-043-106/+109
|
* Some more property functions.Andrei Zmievski2006-05-033-3/+160
| | | | | # I am pondering a different prefix..
* *** empty log message ***Andrei Zmievski2006-05-031-0/+67
|
* Some additional binary property functions.Andrei Zmievski2006-05-022-0/+83
|
* FALSE on empty string.Andrei Zmievski2006-05-021-3/+8
|
* Implement C/POSIX migration functions.Andrei Zmievski2006-05-022-0/+100
|
* Add skeleton for character property file. Also remove some HAVE_UNICODEAndrei Zmievski2006-05-024-9/+32
| | | | | tests since it's non optional.
* Add collator_set_default().Andrei Zmievski2006-04-213-8/+31
|
* Hmm, ZEND_FENTRY() is the only one that allows flags to be added..Andrei Zmievski2006-04-211-0/+4
|
* Rename i18_loc_* to locale_*.Andrei Zmievski2006-04-213-9/+9
|
* Implement collator_get_default() and simplify/fix the underlying code.Andrei Zmievski2006-04-213-41/+31
| | | | | # Derick, objects aren't that difficult.. :)
* Move to refcounted implementation of collators.Andrei Zmievski2006-04-203-17/+39
|
* fix build on Win32Frank M. Kromann2006-04-201-1/+1
|
* Update protos.Andrei Zmievski2006-04-181-1/+9
|
* Another (and hopefully last) major streams commit.Sara Golemon2006-03-295-332/+2
| | | | | | | | | | | | | | | | | | | | | | | | | This moves unicode conversion to the filter layer (rather than at the lower streams layer) unicode_filter.c has been moved from ext/unicode to main/streams as it's an integral part of the streams unicode conversion process. There are now three ways to set encoding on a stream: (1) By context $ctx = stream_context_create(NULL,array('encoding'=>'latin1')); $fp = fopen('somefile', 'r+t', false, $ctx); (2) By stream_encoding() $fp = fopen('somefile', 'r+'); stream_encoding($fp, 'latin1'); (3) By filter $fp = fopen('somefile', 'r+'); stream_filter_append($fp, 'unicode.from.latin1', STREAM_FILTER_READ); stream_filter_append($fp, 'unicode.to.latin1', STREAM_FILTER_WRITE); Note: Methods 1 and 2 are convenience wrappers around method 3.
* Fix collator instantiation.Andrei Zmievski2006-03-281-4/+12
|
* Fix typos.Andrei Zmievski2006-03-281-2/+2
|
* Rewrite unicode_encode() and unicode_decode() functions. Apply the newAndrei Zmievski2006-03-271-47/+67
| | | | | conversion error semantics.
* Add unicode_get_error_mode() and unicode_get_subst_char().Andrei Zmievski2006-03-261-0/+39
|
* - Implemented basic collation support. For some reason "new Collator" gives ↵Derick Rethans2006-03-265-4/+304
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | segfaults when the object's collation resource is used. - The following example shows what is implemented: <?php $orig = $strings = array( 'côte', 'cote', 'côté', 'coté', 'fluße', 'flüße', ); echo "German phonebook:\n"; $c = collator_create( "de@collation=phonebook" ); foreach($c->sort($strings) as $string) { echo $string, "\n"; } echo $c->getAttribute(Collator::FRENCH_COLLATION) == Collator::ON ? "With" : "Without", " french accent sorting order\n"; echo "\nFrench with options:\n"; $c = collator_create( "fr" ); $c->setAttribute(Collator::CASE_FIRST, Collator::UPPER_FIRST); $c->setAttribute(Collator::CASE_LEVEL, Collator::ON); $c->setStrength(Collator::SECONDARY); foreach($c->sort($strings) as $string) { echo $string, "\n"; } echo $c->getAttribute(Collator::FRENCH_COLLATION) == Collator::ON ? "With" : "Without", " french accent sorting order\n"; ?>
* Implement to-Unicode conversion error behavior. Note the adjusted APIs.Andrei Zmievski2006-03-261-18/+8
|
* Add protos.Andrei Zmievski2006-03-261-4/+9
|
* * Remove unicode.from_error_mode and unicode.from_subst_char from INIAndrei Zmievski2006-03-261-5/+70
| | | | | | | settings. * Add unicode_set_error_mode() and unicode_set_subst_char() functions to manipulate these global settings.
* Use intern->type for break iterator.Andrei Zmievski2006-03-241-2/+2
|
* first check for NULL, then use the pointerAntony Dovgal2006-03-241-1/+2
|
* - Moved strtotitle to ext/standard and implemented the fallback case toDerick Rethans2006-03-223-64/+0
| | | | | | | | non-unicode with ucwords. There is also an implementation for unicode ucwords but that returns different results then strtotitle as it uppercases the first character of every word, and doesn't *titlecase* a word. The test case shows that.
* - Update windows file too (not tested, but should work).Derick Rethans2006-03-211-5/+2
|
* - Make ext/unicode an extension that is always there and can not be disabled.Derick Rethans2006-03-211-9/+3
|
* Refactor streams layer for PHP6.Sara Golemon2006-03-131-12/+12
| | | | | | | | | | Don't be frightened by the size of this commit. A significant portion of it is restoring the read buffer semantics back to what PHP4/5 use. (Or a close aproximation thereof). See main/streams/streams.c and ext/standard/file.c for a set of UTODO comments covering work yet to be done.
* Should use word break iteration instead of title, as title one has beenAndrei Zmievski2006-03-021-1/+1
| | | | | deprecated since Unicode 3.2>
* Nuke int32_t (everywhere except streams layer) and signed/unsigned warningsDmitry Stogov2006-03-021-2/+2
|