zoukankan      html  css  js  c++  java
  • [LeetCode] Encode and Decode Strings

    Problem Description:

    Design an algorithm to encode a list of strings to a string. The encoded string is then sent over the network and is decoded back to the original list of strings.

    Machine 1 (sender) has the function:

    string encode(vector<string> strs) {
      // ... your code
      return encoded_string;
    }

    Machine 2 (receiver) has the function:

    vector<string> decode(string s) {
      //... your code
      return strs;
    } 

    So Machine 1 does:

    string encoded_string = encode(strs);

    and Machine 2 does:

    vector<string> strs2 = decode(encoded_string);

    strs2 in Machine 2 should be the same as strs in Machine 1.

    Implement the encode and decode methods.

    Note:

    • The string may contain any possible characters out of 256 valid ascii characters. Your algorithm should be generalized enough to work on any possible characters.
    • Do not use class member/global/static variables to store states. Your encode and decode algorithms should be stateless.
    • Do not rely on any library method such as eval or serialize methods. You should implement your own encode/decode algorithm.

    Well, let's use an example strs = ["#$%", "", "12"] to illustrate the encoding and decoding algorithms. The idea is to use some character as sentinel. But since the input strs may contain any character, including the sentinel, we still need to use other information to avoid ambiguation. Specifically, we use the length of the string. The above string will be encoded as

    3##$%0#2#12

    Each color represents the encoding of each string and we use # as the sentinel.

    During decoding, we will initialize a starting point p to be 0. Then we find the first # starting from p, which is just the sentinel for the first string and characters between p and the first # encode the length of the following string, using which we would be able to extract that string. In the above example, the length of the first string is 3 and we extract 3 characters after the first sentinel # and get #$%, which is just the first string. Then we move p to the point after the first string and continue the above process. Finally, all strings will be extracted out.

    The code is as follows. If you find it not that clear, run it on the above example and you will get how it works.

     1 class Codec {
     2 public:
     3 
     4     // Encodes a list of strings to a single string.
     5     string encode(vector<string>& strs) {
     6         string s;
     7         for (string str : strs)
     8             s += to_string(str.length()) + '$' + str;
     9         return s;
    10     }
    11 
    12     // Decodes a single string to a list of strings.
    13     vector<string> decode(string s) {
    14         vector<string> strs;
    15         size_t n = s.length(), p = 0 ;
    16         while (p < n) {
    17             size_t pos = s.find('$', p);
    18             if (pos == string::npos) break;
    19             size_t sz = stoi(s.substr(p, pos - p));
    20             strs.push_back(s.substr(pos + 1, sz));
    21             p = pos + sz + 1;
    22         }
    23         return strs;
    24     }
    25 };
    26 
    27 // Your Codec object will be instantiated and called as such:
    28 // Codec codec;
    29 // codec.decode(codec.encode(strs));

    BTW, string::npos means the end of a string: if we reach the end of a string, that means there is no sentinel and all the strings have been extracted out, so we will return.

  • 相关阅读:
    阅读书籍推荐
    lvarchar类型对表结构变更影响
    离职总结 | 如何做个好员工?
    Windows7 general operation/cmd notes
    (转)CONST用法
    Linux多线程的使用一:互斥锁
    hadoop2.5.2学习及实践笔记(六)—— Hadoop文件系统及其java接口
    hadoop2.5.2学习及实践笔记(五)—— HDFS shell命令行常见操作
    hadoop2.5.2学习及实践笔记(四)—— namenode启动过程源码概览
    hadoop2.5.2学习及实践笔记(三)—— HDFS概念及体系结构
  • 原文地址:https://www.cnblogs.com/jcliBlogger/p/4768875.html
Copyright © 2011-2022 走看看