Helper tool to transliterate various encodings to latin. Attempts to
convert a character vector from its current encoding to "latin1" and -
if it fails - defaults back to the original term. This can be useful
for geocode and structured when attempting to
geocode terms containing symbols that photon does not support.
Usage
latinize(x, encoding = "latin1")
Arguments
- x
A character vector.
- encoding
Encoding that the strings in x should be
converted to. If the conversion fails, defaults back to the original
encoding. Defaults to "latin1".
Value
The transliterated vector of the same length as x. NAs
are avoided.
Examples
# converts fancy apostrophes to normal ones
latinize("Luatuanu\u2019u")
#> [1] "Luatuanu’u"
# does nothing
latinize("Berlin")
#> [1] "Berlin"
# also does nothing, although it would fail with `iconv`
latinize("\u0391\u03b8\u03ae\u03bd\u03b1")
#> [1] "Αθήνα"