Anklang 0.3.0-460-gc4ef46ba
ASE — Anklang Sound Engine (C++)

« « « Anklang Documentation
Loading...
Searching...
No Matches
Namespaces | Functions | Variables
unicode.hh File Reference
#include <ase/cxxaux.hh>

Go to the source code of this file.

Namespaces

namespace  Ase
 The Anklang C++ API namespace.
 

Functions

std::string Ase::utf8encode (const std::vector< uint32_t > &codepoints)
 Convert codepoints into an UTF-8 string, using the shortest possible encoding.
 
std::string Ase::utf8encode (const uint32_t *codepoints, size_t n_codepoints)
 Convert codepoints into an UTF-8 string, using the shortest possible encoding.
 
String Ase::string_to_ncname (const String &input, uint32_t substitute)
 
bool Ase::string_is_ncname (const String &input)
 
size_t Ase::utf8_to_unicode (const std::string &str, std::vector< uint32_t > &codepoints)
 
size_t Ase::utf8_to_unicode (const char *str, uint32_t *codepoints)
 
size_t Ase::utf8len (const std::string &str)
 Count valid UTF-8 sequences, invalid sequences are counted as Latin-1 characters.
 
size_t Ase::utf8len (const char *str)
 Count valid UTF-8 sequences, invalid sequences are counted as Latin-1 characters.
 
constexpr bool Ase::unicode_is_valid (uint32_t u)
 Return whether u is an allowed Unicode codepoint within 0x10FFFF and not part of a UTF-16 surrogate pair.
 
constexpr bool Ase::unicode_is_assigned (uint32_t u)
 Return whether u matches any of the assigned Unicode planes.
 
constexpr bool Ase::unicode_is_noncharacter (uint32_t u)
 Return whether u is one of the 66 Unicode noncharacters.
 
constexpr bool Ase::unicode_is_character (uint32_t u)
 Return whether u is not one of the 66 Unicode noncharacters.
 
constexpr bool Ase::unicode_is_control_code (uint32_t u)
 Return whether u is one of the 65 Unicode control codes.
 
constexpr bool Ase::unicode_is_private (uint32_t u)
 Return whether u is in one of the 3 private use areas of Unicode.
 
std::string Ase::encodefs (const std::string &fschars)
 Encode a file system path consisting of bytes into UTF-8, using surrogate code points to store non UTF-8 bytes.
 
std::string Ase::decodefs (const std::string &utf8str)
 Decode UTF-8 string back into file system path representation, extracting surrogate code points as bytes.
 
std::string Ase::displayfs (const std::string &utf8str)
 Convert UTF-8 encoded file system path into human readable display format, the conversion is lossy but readable.
 
std::vector< uint32_tAse::utf8decode (const std::string &utf8str)
 Convert valid UTF-8 sequences to Unicode codepoints, invalid sequences are treated as Latin-1 characters.
 

Variables

constexpr uint32_t Ase::unicode_last_codepoint