Replace $number with $number+1 in std::string

Question 1

I want to detect $number substring in std::string and then replace it with $number + 1.

For example, the string Hello9ドルWorld1ドル should become Hello10ドルWorld2ドル.

Here's my code:

#include <iostream>
#include <string>
void modifyDollarNumber(std::string &str)
{
 for (size_t i = str.length(); i --> 0 ;)
 {
 if (str[i] == '$')
 {
 size_t j = i + 1;
 while (j < str.length() && isdigit(str[j]))
 {
 ++j;
 }
 size_t len = j - (i + 1);
 if (len)
 {
 std::string sub = str.substr(i + 1, len);
 int num = std::stoi(sub) + 1;
 str.erase(i + 1, len);
 sub = std::to_string(num);
 str.insert(i + 1, sub);
 }
 }
 }
}
int main()
{
 std::string str = "!$@#34ドル1ドル%^&5ドル*1ドル$!%91ドル12ドル@3ドル";
 modifyDollarNumber(str);
 std::cout << "Result : " << str << '\n';
}

And I can get the result I want which is

Result : !$@#35ドル2ドル%^&6ドル*2ドル$!%92ドル13ドル@4ドル
Program ended with exit code: 0

But I want to improve my code so it can be as fast as possible.

How can I simplify modifyDollarNumber() function?

Question 2

'fast' is generally within a particular usage; does it need to be fast for lots of small strings like the example, or long strings with many numbers...?

Question 3

What's the range of the numbers to be replaced? Can they be negative?

Question 4

@PhilH I will be processing a long string with many $numbers

Question 5

@TobySpeight They can’t be negative so they should be larger than 0 and should be integers only.

Question 6

Good - I've used an unsigned type in my answer, on that assumption.

Question 7

Few observations.

I. You don't need to scan the whole string, character-by-character. str.find('$') will do it for you (and remember to start the second search, and all subsequent ones, from the last reference detected.)

II. Rather than taking substrings, parsing them, erasing and inserting—why not increment in-place? You know exactly your number span (from the one past '$' to the one past the last digit.) If this string only consists of '9's, insert a single '1' right after '$' and change all nines into zeroes. If there are digits less than nine, increment the rightmost non-nine, and change all the subsequent nines into zeroes, if any, no insertion/erasure required. Proceed with the next search.

III. As @AJNeufeld mentioned, string reconstruction can be done in a separate buffer, though vast testing is needed to decide if this is a faster option.

Question 8

thanks for your tips. regarding your second tip, i’m looking for a solution that is easily customizable. for example, it should be easy to change from $(number + 1) to $(number * 2) later.

Question 9

You've misspelt std::size_t throughout, and also std::isdigit (which is missing the necessary include of <cctype> - note also that passing plain char to the character classification functions is risky - cast to unsigned char first).

The in-place modification of your string involves copying increasing parts of it multiple times (even when the replacement string is of the same length). You can avoid that quite simply by using std::string::replace() instead of erase()+insert():

 std::string sub = str.substr(i + 1, len);
 int num = std::stoi(sub) + 1;
 str.replace(i + 1, len, std::to_string(num));

This still leaves a lot of copying when the increment adds a digit (9, 99, 999, ...) - I think your test-case should include at least one of each. To avoid that problem (and to make the usage more intuitive to the caller), it may be better to write a function that returns a copy of the string (so accept it by const reference):

#include <algorithm>
#include <cctype>
#include <string>
std::string modifyDollarNumber(const std::string& str)
{
 std::string result;
 result.reserve(str.length());
 auto out = std::back_inserter(result);
 auto pos = str.cbegin();
 while (pos != str.cend()) {
 auto dollar_pos = std::find(pos, str.cend(), '$');
 std::copy(pos, dollar_pos, out);
 // no more substitutions?
 if (dollar_pos == str.cend()) { break; }
 // copy the dollar sign
 result += '$';
 pos = dollar_pos + 1;
 // is it followed by a number?
 auto digit_end = std::find_if(pos, str.end(),
 [](unsigned char c){ return !std::isdigit(c); });
 if (digit_end == pos) { continue; }
 // copy the incremented number
 auto num = std::stoul(std::string{pos, digit_end});
 result.append(std::to_string(num+1));
 pos = digit_end;
 }
 return result;
}

#include <iostream>
int main()
{
 const std::string str = "1ドル $-22 027ドル $$ $";
 std::cout << "Result : " << modifyDollarNumber(str) << '\n';
}

But if raw speed is more important than readability, you'll need to benchmark with some representative inputs to see which is best for you.

Question 10

is it problematic to write simply ‘size_t’ and ‘isdigit()’ even if it compiles?

Question 11

Yes - even though the compiler you're using right now defines these symbols in the global namespace as well as in std, there's no guarantee that it will continue to do so, and it almost certainly won't be compilable by at least some people who want to use your code. Stick to what's guaranteed in the standard, and your code will be much more portable (and therefore useful).

Question 12

Of course, if you really want to miss off the std prefix, you can using std::size_t; early in your function. But I don't generally recommend that unless you're using it on almost every line.

Question 13

String manipulation can be slow. Inserting and deleting characters requires shifting characters in memory, if modifying in place, or allocating and freeing temporaries.

It could be faster to allocate one destination character buffer (char[] or wchar_t[]), and copy characters into it, performing the translations as required, and then converting into a std::string at the end.

The destination buffer would need space for str.length() + std::count(str.begin(), str.end(), '$') characters, since 9ドル can become 10ドル, etc.

Question 14

Note that using a fresh string as buffer is approx. as effective.

Question 15

I would appreciate a working example if possible.

bipll bipll 9984 silver badges7 bronze badges · Answer 1 · 2018-08-02 15:08:27Z

Few observations.

I. You don't need to scan the whole string, character-by-character. str.find('$') will do it for you (and remember to start the second search, and all subsequent ones, from the last reference detected.)

II. Rather than taking substrings, parsing them, erasing and inserting—why not increment in-place? You know exactly your number span (from the one past '$' to the one past the last digit.) If this string only consists of '9's, insert a single '1' right after '$' and change all nines into zeroes. If there are digits less than nine, increment the rightmost non-nine, and change all the subsequent nines into zeroes, if any, no insertion/erasure required. Proceed with the next search.

III. As @AJNeufeld mentioned, string reconstruction can be done in a separate buffer, though vast testing is needed to decide if this is a faster option.

thanks for your tips. regarding your second tip, i’m looking for a solution that is easily customizable. for example, it should be easy to change from $(number + 1) to $(number * 2) later.

Toby Speight Toby Speight 87.5k14 gold badges104 silver badges323 bronze badges · Answer 2 · 2018-08-02 15:14:06Z

You've misspelt std::size_t throughout, and also std::isdigit (which is missing the necessary include of <cctype> - note also that passing plain char to the character classification functions is risky - cast to unsigned char first).

The in-place modification of your string involves copying increasing parts of it multiple times (even when the replacement string is of the same length). You can avoid that quite simply by using std::string::replace() instead of erase()+insert():

 std::string sub = str.substr(i + 1, len);
 int num = std::stoi(sub) + 1;
 str.replace(i + 1, len, std::to_string(num));

This still leaves a lot of copying when the increment adds a digit (9, 99, 999, ...) - I think your test-case should include at least one of each. To avoid that problem (and to make the usage more intuitive to the caller), it may be better to write a function that returns a copy of the string (so accept it by const reference):

#include <algorithm>
#include <cctype>
#include <string>
std::string modifyDollarNumber(const std::string& str)
{
 std::string result;
 result.reserve(str.length());
 auto out = std::back_inserter(result);
 auto pos = str.cbegin();
 while (pos != str.cend()) {
 auto dollar_pos = std::find(pos, str.cend(), '$');
 std::copy(pos, dollar_pos, out);
 // no more substitutions?
 if (dollar_pos == str.cend()) { break; }
 // copy the dollar sign
 result += '$';
 pos = dollar_pos + 1;
 // is it followed by a number?
 auto digit_end = std::find_if(pos, str.end(),
 [](unsigned char c){ return !std::isdigit(c); });
 if (digit_end == pos) { continue; }
 // copy the incremented number
 auto num = std::stoul(std::string{pos, digit_end});
 result.append(std::to_string(num+1));
 pos = digit_end;
 }
 return result;
}

#include <iostream>
int main()
{
 const std::string str = "1ドル $-22 027ドル $$ $";
 std::cout << "Result : " << modifyDollarNumber(str) << '\n';
}

But if raw speed is more important than readability, you'll need to benchmark with some representative inputs to see which is best for you.

is it problematic to write simply ‘size_t’ and ‘isdigit()’ even if it compiles?
Yes - even though the compiler you're using right now defines these symbols in the global namespace as well as in std, there's no guarantee that it will continue to do so, and it almost certainly won't be compilable by at least some people who want to use your code. Stick to what's guaranteed in the standard, and your code will be much more portable (and therefore useful).
Of course, if you really want to miss off the std prefix, you can using std::size_t; early in your function. But I don't generally recommend that unless you're using it on almost every line.

AJNeufeld AJNeufeld 35.2k5 gold badges41 silver badges103 bronze badges · Answer 3 · 2018-08-02 14:27:59Z

String manipulation can be slow. Inserting and deleting characters requires shifting characters in memory, if modifying in place, or allocating and freeing temporaries.

It could be faster to allocate one destination character buffer (char[] or wchar_t[]), and copy characters into it, performing the translations as required, and then converting into a std::string at the end.

The destination buffer would need space for str.length() + std::count(str.begin(), str.end(), '$') characters, since 9ドル can become 10ドル, etc.

Note that using a fresh string as buffer is approx. as effective.

Stack Exchange Network

Replace $number with $number+1 in std::string

3 Answers 3

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Hot Network Questions

Replace $number with $number+1 in std::string

3 Answers 3

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Related

Hot Network Questions