Maxime Coste
|
ed68d1ff28
|
utf8: use end of sequence iterators for more security
|
2014-07-05 12:10:06 +01:00 |
|
Maxime Coste
|
3f70d91f8c
|
Use unsigned char rather than char in utf8 decoding to avoid sign extension
|
2014-07-05 12:10:06 +01:00 |
|
Maxime Coste
|
db423e4a88
|
utf8::is_character_start takes directly the char value
|
2014-05-14 19:49:03 +01:00 |
|
Maxime Coste
|
2d96f853f8
|
Add utf8::codepoint_size function
|
2013-05-30 18:49:50 +02:00 |
|
Maxime Coste
|
270e950cf1
|
sort includes directives
|
2013-04-09 20:05:40 +02:00 |
|
Maxime Coste
|
5adee4a6a7
|
rename assert to kak_assert to avoid collisions
|
2013-04-09 20:04:11 +02:00 |
|
Maxime Coste
|
9f9ad58b39
|
utf8::dump uses a copy of the output iterator instead of a reference
|
2013-02-27 23:50:33 +01:00 |
|
Maxime Coste
|
7865223587
|
Add utf8::character_start function
|
2013-02-26 14:05:51 +01:00 |
|
Maxime Coste
|
ee882d9d02
|
utf8: use CharCount instead of size_t
|
2012-10-27 13:26:40 +02:00 |
|
Maxime Coste
|
df400f90ab
|
utf8: replace InvalidBytePolicy::Throw with InvalidBytePolicy::Assert
|
2012-10-17 17:01:51 +02:00 |
|
Maxime Coste
|
dfafcdb6e6
|
utf8::codepoint: configurable invalid byte policy
|
2012-10-13 19:05:14 +02:00 |
|
Maxime Coste
|
0ce6bd9bf5
|
use ByteCount instead of CharCount when we are really counting bytes
(that is most of the time when we are not concerned with displaying)
|
2012-10-11 00:41:48 +02:00 |
|
Maxime Coste
|
571861bc7b
|
Return something in utf8::distance, thanks again gcc for letting this work
|
2012-10-11 00:39:17 +02:00 |
|
Maxime Coste
|
ffba94fcde
|
Actually return something in utf8::codepoint, thanks gcc for using rax
|
2012-10-10 19:14:18 +02:00 |
|
Maxime Coste
|
7a8366da2b
|
add a unicode.hh header for Codepoint related functions, s/utf8::Codepoint/Codepoint/
|
2012-10-09 19:15:05 +02:00 |
|
Maxime Coste
|
1af7465107
|
utf8: add dump(OutputIterator& it, Codepoint cp)
|
2012-10-09 14:29:37 +02:00 |
|
Maxime Coste
|
2db1d02329
|
add utf8 helpers in utf8.hh
|
2012-10-08 14:25:05 +02:00 |
|