Search code examples
phpregexutf-8special-charactersstring-conversion

PHP: Convert specific-Bosnian characters to non-bosnian (utf8 standard chars)


In Bosnia we have following characters only used in latin-form in Bosnia and Croatia, so I'd need to convert these letters as following:

FROM | TO
  ć  | c
  č  | c
  ž  | z
  š  | s
  đ  | dj

If this is possible with some special form of RegEx, or utf8_encode/decode, that informatiion and an appopriate example will be quite welcome! Thanks all.

PS - Want to achive this in PHP!


Solution

  • You can use this with iconv.

    $result = iconv("UTF-8", "ASCII//TRANSLIT", $text);
    

    That will work assuming your input $text is in utf-8. If it's in latin-1 then use

    iconv("ISO-8859-1", "ASCII//TRANSLIT", $text);
    

    Of cause your PHP must have iconv extension, most often iconv is enabled in php.ini file, but not always.