Re: [Scheme-reports] DISCUSSION/VOTE: The character tower Bear (06 May 2014 05:49 UTC)
Re: [Scheme-reports] DISCUSSION/VOTE: The character tower Sascha Ziemann (07 May 2014 08:16 UTC)
Re: [Scheme-reports] DISCUSSION/VOTE: The character tower Per Bothner (08 May 2014 01:35 UTC)
Re: [Scheme-reports] DISCUSSION/VOTE: The character tower Alaric Snell-Pym (08 May 2014 12:22 UTC)
Re: [Scheme-reports] DISCUSSION/VOTE: The character tower Jussi Piitulainen (08 May 2014 05:36 UTC)
Re: [Scheme-reports] DISCUSSION/VOTE: The character tower Shiro Kawai (06 May 2014 21:04 UTC)

Re: [Scheme-reports] DISCUSSION/VOTE: The character tower Bear 06 May 2014 05:44 UTC

On Mon, 2014-05-05 at 20:51 -0400, John Cowan wrote:

> 5) Should R7RS-large implementations be required to provide the characters
> from #\x80 to #\xFF?  (All Schemes in my test suite do so.)

Yes.

> 6) Should R7RS-large implementations be required to provide the
> characters from #\x100 to #\xFFFF, excluding the surrogate code points
> from #\xD800 to \#xD8FF, which do not correspond to Unicode scalar values?
> (JVM and CLR implementations other than Kawa do this.)  Voting yes on
> this question implies a yes vote on #5.

Yes, with the exception of code points which are not actually mapped to
any character by the Unicode standard.

> 7) Should R7RS-large implementations be required to
> provide the characters from #\x10000 to #\x10FFFF?  (R6RS
> implementations and many R5RS and R7RS implementations do this.
> See <http://trac.sacrideo.us/wg/wiki/UnicodeSupport> for details on
> particular implementations.)  Voting yes on this question implies a yes
> vote on #5 and #6.

No.

> 8) Should R7RS-large implementations be required to allow #\x0 in strings?
> (There have been implementations in the past which did not, for the sake
> of simpler interchange with C, but none of the test-suite implementations
> have this restriction.)

Abstention.  If an implementation is serious enough about Unicode
support to keep its strings in a Unicode normalized form, which ought
not be forbidden, then NUL can never appear in any string. If one wants
codepoint array semantics rather than string semantics, then one
probably wanted blobs instead of strings. OTOH, strings with codepoint
array semantics have a long history of use and it's hard to imagine
people abandoning them.  OTGH, strings containing NULs have been a
problem since inception for FFIs to languages that use NUL-terminated
strings.

> 9) Should R7RS-large implementations be required to allow the characters
> from #\x80 to #\xFF in strings?  (All implementations in the test suite
> do so.)

Yes.

> 10) Should R7RS-large implementations be required to allow the characters
> from #\x100 to #\xFFFF in strings?  (MIT and RScheme do not, even though
> they support them as character objects.)  Voting yes on this question
> implies a yes vote on #9.

Yes, with the exception of code points which are not actually mapped to
any character by the unicode standard and code points which have a
canonical decomposition (ie, the standard ought to allow an
implementation to implement strings as unicode normalized strings).

> 11) Should R7RS-large implementations be required to allow the characters
> from #\x10000 to #\x10FFFF in strings?  (Again, MIT does not, even though
> it supports them as character objects.)  Voting yes on this question
> implies a yes vote on #10.

No.

> 12) Should R7RS-large implementations be required to support identifiers
> with non-ASCII characters as specified in Section 7.1.1 of R7RS-large?
> (Most implementations do, either deliberately or because they support
> almost everything as an identifier.)  This permits the use of most
> languages as a source of Scheme identifiers.

Yes.  Identifiers which are distinct when in NFKC/NFKD normalized form
must be considered distinct by all implementations.  Identifiers which
are not distinct when normalized as NFD/NFK must _not_ be considered
distinct by any implementation.  The standard should give a definite
rule about identifiers which are distinct in NFD/NFK normalizations, but
identical in NFKC/NFKD normalizations; are they to be considered
distinct, considered identical, or is that implementation-defined?

> 13) Should R7RS-large implementations be required to provide the
> (scheme char) library, which is optional in R7RS-small?  It contains the
> procedures which require O(n)-sized tables, where n is the number of
> supported characters in the implementation, namely: char-alphabetic?,
> char-lower-case?, char-upper-case?, char-whitespace?, char-numeric?;
> char- and string-upcase, -downcase, and -foldcase; and the char-ci and
> string-ci procedures.  (Essentially all implementations do so, as all
> of these are required in R6RS and all but the foldcase procedure are
> required in R5RS.  The library was made optional in R7RS-small in order
> to support embedded implementations that wanted to provide the full
> range of characters but could not afford the space for tables.)

Yes.

			Bear

_______________________________________________
Scheme-reports mailing list
Scheme-reports@scheme-reports.org
http://lists.scheme-reports.org/cgi-bin/mailman/listinfo/scheme-reports