Struct encoding::codec::utf_8::UTF8Encoding
source · pub struct UTF8Encoding;
Expand description
UTF-8 (UCS Transformation Format, 8-bit).
This is a Unicode encoding compatible to ASCII (ISO/IEC 646:US) and able to represent all Unicode codepoints uniquely and unambiguously. It has a variable-length design, where one codepoint may use 1 (up to U+007F), 2 (up to U+07FF), 3 (up to U+FFFF) and 4 bytes (up to U+10FFFF) depending on its value. The first byte of the sequence is distinct from other “continuation” bytes of the sequence making UTF-8 self-synchronizable and easy to handle. It has a fixed endianness, and can be lexicographically sorted by codepoints.
The UTF-8 scanner used by this module is heavily based on Bjoern Hoehrmann’s Flexible and Economical UTF-8 Decoder.
Trait Implementations§
source§impl Clone for UTF8Encoding
impl Clone for UTF8Encoding
source§fn clone(&self) -> UTF8Encoding
fn clone(&self) -> UTF8Encoding
Returns a copy of the value. Read more
1.0.0 · source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source
. Read moresource§impl Encoding for UTF8Encoding
impl Encoding for UTF8Encoding
source§fn name(&self) -> &'static str
fn name(&self) -> &'static str
Returns the canonical name of given encoding.
This name is guaranteed to be unique across built-in encodings,
but it is not normative and would be at most arbitrary.
source§fn whatwg_name(&self) -> Option<&'static str>
fn whatwg_name(&self) -> Option<&'static str>
Returns a name of given encoding defined in the WHATWG Encoding standard, if any.
This name often differs from
name
due to the compatibility reason.source§fn raw_encoder(&self) -> Box<dyn RawEncoder>
fn raw_encoder(&self) -> Box<dyn RawEncoder>
Creates a new encoder.
source§fn raw_decoder(&self) -> Box<dyn RawDecoder>
fn raw_decoder(&self) -> Box<dyn RawDecoder>
Creates a new decoder.
source§fn encode(
&self,
input: &str,
trap: EncoderTrap
) -> Result<Vec<u8>, Cow<'static, str>>
fn encode( &self, input: &str, trap: EncoderTrap ) -> Result<Vec<u8>, Cow<'static, str>>
An easy-to-use interface to
RawEncoder
.
On the encoder error trap
is called,
which may return a replacement sequence to continue processing,
or a failure to return the error.source§fn encode_to(
&self,
input: &str,
trap: EncoderTrap,
ret: &mut dyn ByteWriter
) -> Result<(), Cow<'static, str>>
fn encode_to( &self, input: &str, trap: EncoderTrap, ret: &mut dyn ByteWriter ) -> Result<(), Cow<'static, str>>
Encode into a
ByteWriter
.