Replace accented characters

Database

Replace accented characters

Hi all,

 

I need to remove any accented character, find the following list (http://stackoverflow.com/questions/3371697/replacing-accented-characters-php):

a - àáâãåæāăąǎǻаא
ae - ä æ ǽ
b - þбב
c - ç ć ĉ ċ č ц ץ צ
ch - ч
d - ð ď đ д ד
e - è é ê ë ē ĕ ė ę ě ə е є ע
f - ƒ ф
g - ĝ ğ ġ ģ г ґ ג
h - ĥ ħ х ה ח
i - ì í î ï ĩ ī ĭ į İ ı ǐ и і ї י
ij - ij
j - ĵ й
ja - я
je - э
jo - ё
ju - ю
k - ķ ĸ к ך כ
l - ĺ ļ ľ ŀ ł л ל
m - м ם מ
n - ñ ń ņ ň ʼn ŋ н ן נ
o - ò ó ô õ ø ō ŏ ő ơ ǒ ǿ о
oe - ö œ
p - п ף פ
q - ק
r - ŕ ŗ ř р ר
s - ś ŝ ş š ș с ס
sch - щ
sh - ш
ss - ß
t - ţ ť ŧ ț т ט ת
tm - ™
u - ù ú û ũ ū ŭ ů ű ų ư ǔ ǖ ǘ ǚ ǜ у
ue - ü
v - в ו
w - ŵ ש
y - ý ÿ ŷ ы
z - ź ż ž ſ з ז
zh - ж

 

I've thought of using nested regex (REGEXP_REPLACE('source_string','[àáâãåæāăąǎǻаא]','a',1,0,'i'),...) to replace accented characters, but I'm not sure it's the best performance option.

 

Does anyone know any other method?

Thank you,

 

Regards,
Misgate.

 

  • accented
  • character
  • convert
  • replace
  • utf-8
1 REPLY
Senior Apprentice

Re: Replace accented characters

For all those single characters you could do a huge oTranslate (don't forget to add both upper & lower case), but you still need a dozen nested RegExp_Replace for the remaining. If you need to run this against huge amouts of text it's probably much more efficient to code a C-UDF.