summaryrefslogtreecommitdiff
path: root/contrib/file_fdw
diff options
context:
space:
mode:
authorTom Lane <tgl@sss.pgh.pa.us>2022-08-02 10:29:35 -0400
committerTom Lane <tgl@sss.pgh.pa.us>2022-08-02 10:29:35 -0400
commitc034b629cc6f44099c9f54f3f0b3f4340e02d9bc (patch)
tree2c2a6bfda8f57cef756f3db06d0e96d6b6521395 /contrib/file_fdw
parent5b94d3ccb7ad9be902c37505ed54aabd2aeeccf1 (diff)
Change type "char"'s I/O format for non-ASCII characters.
Previously, a byte with the high bit set was just transmitted as-is by charin() and charout(). This is problematic if the database encoding is multibyte, because the result of charout() won't be validly encoded, which breaks various stuff that expects all text strings to be validly encoded. We've previously decided to enforce encoding validity rather than try to individually harden each place that might have a problem with such strings, so it's time to do something about "char". To fix, represent high-bit-set characters as \ooo (backslash and three octal digits), following the ancient "escape" format for bytea. charin() will continue to accept the old way as well, though that is only reachable in single-byte encodings. Add some test cases just so there is coverage for this code. We'll otherwise leave this question undocumented as it was before, because we don't really want to encourage end-user use of "char". For the moment, back-patch into v15 so that this change appears in 15beta3. If there's not great pushback we should consider absorbing this change into the older branches. Discussion: https://postgr.es/m/2318797.1638558730@sss.pgh.pa.us
Diffstat (limited to 'contrib/file_fdw')
0 files changed, 0 insertions, 0 deletions