c 漢字轉拼音 GBK,GB2312

參考　

using system.text.regularexpressions;

//利用正規表示式驗證漢字和英文,阿拉伯數字

system.text.regularexpressions.regex reg = new system.text.regularexpressions.regex("^[\u4e00-\u9fa5]+$ || ^[a-za-z0-9]+$"); //驗證漢字和英文,阿拉伯數字

if (reg.match(textbox5.text.tostring()).success)}}

else

public static string getspell(string strtext) //自已寫的函式

return mystr;

}private static string getfirstspell(string cnchar) //自己寫的函式，下面的**是效率高的，缺點就是只支援gb2312，不支援gbk(有很多偏僻的字讀不出來)

;for (int i = 0; i < 26; i++));}

return "*";

}return cnchar;

}//下面這個支援gbk，但是效率不是很高

private static string getgbkx(string str)

//table of the constant list(以下是拼音的代表號)

// 'a'; //45217..45252

// 'b'; //45253..45760

// 'c'; //45761..46317

// 'd'; //46318..46825

// 'e'; //46826..47009

// 'f'; //47010..47296

// 'g'; //47297..47613

// 'h'; //47614..48118

// 'j'; //48119..49061

// 'k'; //49062..49323

// 'l'; //49324..49895

// 'm'; //49896..50370

// 'n'; //50371..50613

// 'o'; //50614..50621

// 'p'; //50622..50905

// 'q'; //50906..51386

// 'r'; //51387..51445

// 's'; //51446..52217

// 't'; //52218..52697

//沒有u,v

// 'w'; //52698..52979

// 'x'; //52980..53640

// 'y'; //53689..54480

// 'z'; //54481..55289

漢字轉拼音 gb2312 C

public class converthztopz gb2312 region gb2312中的漢字編碼 01 09區為特殊符號。16 55區為一級漢字，按拼音排序。56 87區為二級漢字，按部首筆畫排序。每個漢字及符號以兩個位元組來表示。第乙個位元組稱為高位位元組第二個位元組稱為低位位元...

GBK GB2312和UTF 8編碼區分

gbk包含全部中文字元，gbk的文字編碼是雙位元組來表示的，即不論中英文本元均使用雙位元組來表示，只不過為區分中文，將其最高位都定成1。至於utf 8編碼則是用以解決國際上字元的一種多位元組編碼，它對英文使用8位即乙個位元組中文使用24位三個位元組來編碼。對於英文本元較多的論壇則用ut...

C 漢字轉拼音

region 獲取漢字的全拼 getpinyin 和拼音首字母 getcodstring region 屬性資料定義,漢字的機內碼陣列,機內碼對應的拼音陣列漢字的機內碼陣列 private static int pyvalue newint 機內碼對應的拼音陣列 private static st...

c 漢字轉拼音 GBK,GB2312

漢字轉拼音 gb2312 C

GBK GB2312和UTF 8編碼區分

C 漢字轉拼音

相關推薦