Package org.eclipse.jetty.util
Class Utf8Appendable
java.lang.Object
org.eclipse.jetty.util.Utf8Appendable
- All Implemented Interfaces:
CharsetStringBuilder
- Direct Known Subclasses:
Utf8StringBuffer
,Utf8StringBuilder
Utf8 Appendable abstract base class
This abstract class wraps a standard
Appendable
and provides methods to append UTF-8 encoded bytes, that are converted into characters.
This class is stateful and up to 4 calls to append(byte)
may be needed before state a character is appended to the string buffer.
The UTF-8 decoding is done by this class and no additional buffers or Readers are used. The UTF-8 code was inspired by
http://bjoern.hoehrmann.de/utf-8/decoder/dfa/
License information for Bjoern Hoehrmann's code:
Copyright (c) 2008-2009 Bjoern Hoehrmann <bjoern@hoehrmann.de>
Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS
FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER
IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.-
Nested Class Summary
Nested classes/interfaces inherited from interface org.eclipse.jetty.util.CharsetStringBuilder
CharsetStringBuilder.DecoderStringBuilder, CharsetStringBuilder.Iso8859StringBuilder, CharsetStringBuilder.UsAsciiStringBuilder
-
Field Summary
Modifier and TypeFieldDescriptionprotected final Appendable
protected int
protected static final org.slf4j.Logger
static final char
static final byte[]
-
Constructor Summary
-
Method Summary
Modifier and TypeMethodDescriptionvoid
append
(byte b) void
append
(byte[] b) void
append
(byte[] b, int offset, int length) boolean
append
(byte[] b, int offset, int length, int maxChars) void
append
(char c) void
void
void
append
(ByteBuffer buf) void
appendByte
(byte b) void
abstract String
boolean
abstract int
length()
protected void
reset()
Take the partial string an reset in internal buffer, but retain partial code points.Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
Methods inherited from interface org.eclipse.jetty.util.CharsetStringBuilder
takeString
-
Field Details
-
LOG
protected static final org.slf4j.Logger LOG -
REPLACEMENT
public static final char REPLACEMENT- See Also:
-
REPLACEMENT_UTF8
public static final byte[] REPLACEMENT_UTF8 -
_appendable
-
_state
protected int _state
-
-
Constructor Details
-
Utf8Appendable
-
-
Method Details
-
length
public abstract int length() -
reset
protected void reset() -
append
public void append(char c) -
append
-
append
-
append
public void append(byte b) - Specified by:
append
in interfaceCharsetStringBuilder
-
append
- Specified by:
append
in interfaceCharsetStringBuilder
-
append
public void append(byte[] b) - Specified by:
append
in interfaceCharsetStringBuilder
-
append
public void append(byte[] b, int offset, int length) - Specified by:
append
in interfaceCharsetStringBuilder
-
append
public boolean append(byte[] b, int offset, int length, int maxChars) -
appendByte
- Throws:
IOException
-
isUtf8SequenceComplete
public boolean isUtf8SequenceComplete() -
checkState
public void checkState() -
getPartialString
- Returns:
- The UTF8 so far decoded, ignoring partial code points
-
takePartialString
Take the partial string an reset in internal buffer, but retain partial code points.- Returns:
- The UTF8 so far decoded, ignoring partial code points
-
toReplacedString
-