8. String to Integer (atoi)

  • Hardness: \(\color{orange}\textsf{Medium}\)
  • Ralated Topics: String

一、題目

Implement the myAtoi(string s) function, which converts a string to a 32-bit signed integer (similar to C/C++’s atoi function).
The algorithm for myAtoi(string s) is as follows:

  1. Read in and ignore any leading whitespace.
  2. Check if the next character (if not already at the end of the string) is '-' or '+'. Read this character in if it is either. This determines if the final result is negative or positive respectively. Assume the result is positive if neither is present.
  3. Read in next the characters until the next non-digit character or the end of the input is reached. The rest of the string is ignored.
  4. Convert these digits into an integer (i.e. "123" -> 123, "0032" -> 32). If no digits were read, then the integer is 0. Change the sign as necessary (from step 2).
  5. If the integer is out of the 32-bit signed integer range [-2^31, 2^31-1], then clamp the integer so that it remains in the range. Specifically, integers less than -2^31 should be clamped to -2^31, and integer greater than 2^31-1 should be clamped to 2^31-1.
  6. Return the integer as the final result.

Note:

  • Only the space character ' ' is considered a whitespace character.
  • Do not ignore any characters other than the leading whitespace or the rest of the string after the digits.

Example 1:

  • Input: s = “42”
  • Output: 42
  • Explanation: The underlined characters are what is read in, the caret is the current reader position.
    Step 1: “42” (no characters read because there is no leading whitespace)
    Step 2: “42” (no characters read because there is neither a ‘-’ nor ‘+’)
    Step 3: “42” (“42” is read in)
    The parsed integer is 42.
    Since 42 is in the range [-2^31, 2^31 - 1], the final result is 42.

Example 2:

  • Input: s = " -42"
  • Output: -42
  • Explanation:
    Step 1: " -42" (leading whitespace is read and ignored)
    Step 2: " -42" (’-’ is read, so the result should be negative)
    Step 3: " -42" (“42” is read in)
    The parsed integer is -42.
    Since -42 is in the range [-2^31, 2^31 - 1], the final result is -42.

Example 3:

  • Input: s = “4193 with words”
  • Output: 4193
  • Explanation:
    Step 1: “4193 with words” (no characters read because there is no leading whitespace)
    Step 2: “4193 with words” (no characters read because there is neither a ‘-’ nor ‘+’)
    Step 3: “4193 with words” (“4193” is read in; reading stops because the next character is a non-digit)
    The parsed integer is 4193.
    Since 4193 is in the range [-2^31, 2^31 - 1], the final result is 4193.

Constraints:

  • 0 <= s.length <= 200
  • s consists of English letters (lower-case and upper-case), digits (0-9), ' ', '+', '-', and '.'.

二、分析

  • 此題只需根據條件一一去排除:
    1. 處理 leading spaces
    2. 判斷 s[0]
    3. 處理超過範圍 [-2^31, 2^31-1] 的數
    4. 處理終止條件
  • 可代入幾個處理字串常用的自訂函式:
    • 判斷是否為英文字母 isLetter()
    bool isLetter(char c) {
        return (c >= 'a' && c <= 'z') || (c >= 'A' && c <= 'Z');
    }
    
    • 判斷是否為數字 isDigit()
    bool isDigit(char c) {
        return c >= '0' && c <= '9';
    }
    
    • 去除左邊空白、右邊空白 ltrim()rtrim()
    void ltrim(string& s) {
        s.erase(
            s.begin(),
            find_if(s.begin(), s.end(), not1(ptr_fun<int, int>(isspace)))
        );
    }
    void rtrim(string& s) {
        s.erase(
            find_if(s.rbegin(), s.rend(), not1(ptr_fun<int, int>(isspace))).base(),
            s.end()
    
        );
    }
    
    • 分隔 split()
    vector<string> split(string& s, char del) {
        stringstream ss(str);
        string item;
        vector<string> res;
        while (getline(ss, item, del)) 
            if (!item.empty()) res.push_back(item);
        return res;
    }
    
    • 合併 concat()
    string concat(vector<string>& svec, char del) {
        stringstream ss;
        for (const auto& s : svec) ss << s << del;
        return ss.str();
    }
    

三、解題

1. String

  • Time complexity: \(O(n)\)
  • Space complexity: \(O(1)\)
// 判斷是否為英文字母
bool isLetter(char c) {
    return (c >= 'a' && c <= 'z') || (c >= 'A' && c <= 'Z');
}
// 判斷是否為數字
bool isDigit(char c) {
    return c >= '0' && c <= '9';
}
// 去除左邊空白
void ltrim(string& s) {
    s.erase(
        s.begin(),
        find_if(s.begin(), s.end(), not1(ptr_fun<int, int>(isspace)))
    );
}
int myAtoi(string s) {
    int res = 0;
    
    // 去除左邊空白
    ltrim(s);

    // 判斷 s[0],取正負值
    bool neg = false;
    if (s[0] == '-')
        neg = true;
    else if (isEnglish(s[0]) || s[0] == '.')
        return 0;
    else if (isDigit(s[0]))
        res += (s[0] - '0');
    
    for (int i = 1; i < s.length(); i++) {
        // 終止條件:不再是數字
        if (!isDigit(s[i])) break;  
        
        // 超出數字範圍
        if (neg && (res < INT_MIN/10 || (res == INT_MIN/10 && s[i] == '9'))) return INT_MIN;
        else if (!neg && (res > INT_MAX/10 || (res == INT_MAX/10 && s[i] >= '8'))) return INT_MAX;
        
        // 累計數值
        if (neg) 
            res = 10 * res - (s[i] - '0');
        else
            res = 10 * res + (s[i] - '0');
    }
    return res;
    
}

回目錄 Catalog