Fossil

Timeline
Login

Many hyperlinks are disabled.
Use anonymous login to enable hyperlinks.

29 check-ins related to "invalid_utf8_table"

2016-06-17
00:04
merged from trunk (Closed-Leaf check-in: 8a877a7b user: sdr tags: invalid_utf8_table)
2016-06-16
22:14
shrunk size of lead byte table for invalid_utf8, and took a shortcut to invalidate lead bytes between 0x80 & 0xBF inclusive (check-in: 69328517 user: sdr tags: trunk)
17:01
more optimizations (all lead bytes between 0x80 & 0xBF are invalid, so use simple check for those, and also can shrink the invalid_utf8 lead byte table even more) (check-in: 6eb9a30c user: sdr tags: invalid_utf8_table)
12:14
More optimizations, taken over from trunk. (check-in: ec7f6b2e user: jan.nijtmans tags: invalid_utf8_table)
11:39
Minor further speed-up: Only increment pointer if really needed. (check-in: 5be2e9cf user: jan.nijtmans tags: trunk)
09:44
merge-mark (check-in: c22ec007 user: jan.nijtmans tags: invalid_utf8_table)
09:44
Further invalid_utf8() improvement: Save one indirection and a check, and make the table size even smaller. (check-in: 6a59dbbb user: jan.nijtmans tags: trunk)
09:14
Merge trunk (check-in: a3328c00 user: jan.nijtmans tags: invalid_utf8_table)
09:13
Use faster table-based approach when checking for invalid utf-8, in stead of complex bit-operations. (check-in: 60349a66 user: jan.nijtmans tags: trunk)
08:04
A little bit faster, by only accessing *z once and using bit-operations in stead of comparisions. (check-in: f246ea43 user: jan.nijtmans tags: invalid_utf8_table)
07:46
Unroll loop for even greater speed (check-in: cc09e003 user: jan.nijtmans tags: invalid_utf8_table)
2016-06-15
15:00
added a few comments (check-in: 63313a5f user: sdr tags: invalid_utf8_table)
08:19
Simplifications and constification (check-in: 6051c441 user: jan.nijtmans tags: invalid_utf8_table)
2016-06-14
18:08
merged from trunk (check-in: 12675ab7 user: sdr tags: invalid_utf8_table)
18:06
moved static table data out of invalid_utf8 and pre-initialized the table to avoid run time overhead; also shrunk the table by 50% by removing the ASCII code points (check-in: d3fc3772 user: sdr tags: invalid_utf8_table)
14:11
Forgot close tag in previous commit (check-in: 3b3310a7 user: jan.nijtmans tags: trunk)
05:00
merged from trunk (check-in: d22c72bc user: sdr tags: invalid_utf8_table)
2016-06-13
08:29
Fix handling of byte sequence \xF4\x00 in utf-8 validation. Now all tests pass again. Thanks to Scott Robison for discovering this! (check-in: 5684672a user: jan.nijtmans tags: trunk)
2016-06-11
05:23
Further coding style improvements for the new invalid_utf8() function. Also fixes several C99-isms. (check-in: 2fb7d59b user: mistachkin tags: invalid_utf8_table)
00:41
reformatted invalid_utf8 to make it conform a bit better to existing style (check-in: dd3bb22c user: sdr tags: invalid_utf8_table)
00:13
merged from trunk (check-in: 4f906e53 user: sdr tags: invalid_utf8_table)
00:11
fixed a bad function name; had tested it external to fossil but didn't build/test before committing; mea culpa (check-in: 2e7a6cb0 user: sdr tags: invalid_utf8_table)
00:06
added new tests for invalid utf8 sequences (check-in: 6da500a1 user: sdr tags: trunk)
2016-06-10
20:45
performance optimizations (check-in: 635f3b03 user: sdr tags: invalid_utf8_table)
11:52
Merge additional test-cases from trunk. All pass now. (check-in: 8a65d6f0 user: jan.nijtmans tags: invalid_utf8_table)
11:49
Add many test-cases of (possible) invalid UTF-8 sequences (check-in: acf7fe47 user: jan.nijtmans tags: trunk)
08:07
proposed new invalid_utf8 function (check-in: e58334a0 user: sdr tags: invalid_utf8_table)
2016-06-09
23:59
fixed an edge case in invalid_utf8 where a valid three byte seq could be misidentified (check-in: 314cdab0 user: sdr tags: invalid_utf8_table)
15:26
Add mimetypes for bz2 and bzip. (check-in: c7e9625d user: drh tags: trunk)