comparing utf8s

Date view Thread view Subject view Author view

From: Godmar Back (gback@marker.cs.utah.edu)
Date: Sun Dec 20 1998 - 23:30:00 EST


Also, section 4.4.7 of the VM spec says:

    There are two differences between this format and the "standard"
    UTF-8 format. First, the null byte (byte)0 is encoded using the
    two-byte format rather than the one-byte format, so that Java Virtual
    Machine UTF-8 strings never have embedded nulls. Second, only the
    one-byte, two-byte, and three-byte formats are used. The Java Virtual
    Machine does not recognize the longer UTF-8 formats.

The way I read this is that Java's Utf8s are meant to be compared with
strcmp. The GET macro should not be necessary.

        - Godmar


Date view Thread view Subject view Author view

This archive was generated by hypermail 2b29 : Sat Sep 23 2000 - 19:57:26 EDT