C#?Unicode編碼解碼如何實(shí)現(xiàn)

發(fā)布時間：2022-06-22 09:36:45 來源：億速云閱讀：514 作者：iii 欄目：開發(fā)技術(shù)

本文小編為大家詳細(xì)介紹“C# Unicode編碼解碼如何實(shí)現(xiàn)”，內(nèi)容詳細(xì)，步驟清晰，細(xì)節(jié)處理妥當(dāng)，希望這篇“C# Unicode編碼解碼如何實(shí)現(xiàn)”文章能幫助大家解決疑惑，下面跟著小編的思路慢慢深入，一起來學(xué)習(xí)新知識吧。

Unicode是計算機(jī)科學(xué)領(lǐng)域里的一項(xiàng)業(yè)界標(biāo)準(zhǔn),包括字符集、編碼方案等。Unicode 是為了解決傳統(tǒng)的字符編碼方案的局限而產(chǎn)生的，它為每種語言中的每個字符設(shè)定了統(tǒng)一并且唯一的二進(jìn)制編碼，以滿足跨語言、跨平臺進(jìn)行文本轉(zhuǎn)換、處理的要求。

在表示一個Unicode的字符時，通常會用“U+”然后緊接著一組十六進(jìn)制的數(shù)字來表示這一個字符。在基本多文種平面里的所有字符，要用四位十六進(jìn)制數(shù)；在零號平面以外的字符則需要使用五位或六位十六進(jìn)制數(shù)了。

string str = @"\u0005 \u0002\U00f3 \U +e9\u00e9";
string newStr = UnicodeDecode(str);
Console.WriteLine(newStr);
Console.WriteLine();

newStr = ToUnicode("0 - * @ ， 。 ？ 真的 繁體字");
Console.WriteLine(newStr);
Console.WriteLine();

正常字符轉(zhuǎn)換為unicode

        /// <summary>
        /// 對正常的字符串轉(zhuǎn)換為 Unicode 的字符串
        /// </summary>
        /// <param name="normalStr">正常的字符串</param>
        /// <param name="isIgnoreSpace">是否忽略空格符；默認(rèn) true 空格符不轉(zhuǎn)換；false 空格符要轉(zhuǎn)換</param>
        /// <param name="isUpperCaseU">是否大寫U字母 ‘\U'；默認(rèn) false ‘\u'</param>
        /// <returns></returns>
        public string ToUnicode(this string normalStr, bool isIgnoreSpace = true, bool isUpperCaseU = false)
        {
            if (string.IsNullOrEmpty(normalStr))
            {
                return string.Empty;
            }

            StringBuilder strResult = new StringBuilder();

            void func(int index)
            {
                if (isUpperCaseU)
                {
                    strResult.Append("\\U");
                }
                else
                {
                    strResult.Append("\\u");
                }
                strResult.Append(((int)normalStr[index]).ToString("x").PadLeft(4, '0'));
            }

            for (int i = 0; i < normalStr.Length; i++)
            {
                if (isIgnoreSpace)
                {
                    if (normalStr[i] == ' ')
                    {
                        strResult.Append(" ");
                    }
                    else
                    {
                        func(i);
                    }
                }
                else
                {
                    func(i);
                }
            }
            return strResult.ToString();
        }

解碼

        /// <summary>
        /// 對 Unicode 的字符串解碼
        /// </summary>
        /// <param name="unicodeStr">Unicode 字符串</param>
        /// <returns></returns>
        public string UnicodeDecode(string unicodeStr)
        {
            if (string.IsNullOrWhiteSpace(unicodeStr) || (!unicodeStr.Contains("\\u") && !unicodeStr.Contains("\\U")))
            {
                return unicodeStr;
            }

            string newStr = Regex.Replace(unicodeStr, @"\\[uU](.{4})", (m) =>
            {
                string unicode = m.Groups[1].Value;
                if (int.TryParse(unicode, System.Globalization.NumberStyles.HexNumber, null, out int temp))
                {
                    return ((char)temp).ToString();
                }
                else
                {
                    return m.Groups[0].Value;
                }
            }, RegexOptions.Singleline);

            return newStr;
        }

讀到這里，這篇“C# Unicode編碼解碼如何實(shí)現(xiàn)”文章已經(jīng)介紹完畢，想要掌握這篇文章的知識點(diǎn)還需要大家自己動手實(shí)踐使用過才能領(lǐng)會，如果想了解更多相關(guān)內(nèi)容的文章，歡迎關(guān)注億速云行業(yè)資訊頻道。

向AI問一下細(xì)節(jié)

C#?Unicode編碼解碼如何實(shí)現(xiàn)

猜你喜歡

最新資訊

相關(guān)推薦

相關(guān)標(biāo)簽