Identify duplicate characters in a String and replace them

Question 1

Found a JavaScript assignment on Codewars.com:

The goal of this exercise is to convert a string to a new string where each character in the new string is '(' if that character appears only once in the original string, or ')' if that character appears more than once in the original string. Ignore capitalization when determining if a character is a duplicate.

Examples:
"din" => "((("
"recede" => "()()()"
"Success" => ")())())"
"(( @" => "))(("

I did the assignment in Java, and came up with below code. I want to know is this the write way to code or is there a better way? If so please let me know.

private static String duplicateEncode(String word) {
 String encodeString = "";
 String decodedString = word.toLowerCase();
 for (int i = 0; i < decodedString.length(); i++) {
 if (decodedString.substring(i + 1, decodedString.length())
 .contains(String.valueOf(decodedString.charAt(i)))
 || decodedString.substring(0, i).contains(String.valueOf(decodedString.charAt(i)))) {
 encodeString += ")";
 } else {
 encodeString += "(";
 }
 }
 return encodeString;
}

Question 2

First of all, nullbyte is exactly right about the use of a StringBuilder: using a += b for Strings roughly equals new StringBuilder().append(a).append(b).toString() under the hood and thus is a quite expensive operation.

Then, your method to check whether a given charcter exists in the string after a given index is s.substring(i).contains(ch) which again creates a superfluos string object via the substring-call. Instead, simply use s.indexOf(ch, i) to find the next index of ch after the given starting index (returns -1 for "not found").

Additionally, you might consider nullbyte's suggestion to hold the seen-or-not-seen status in a set, which also was my first intuition when reading the exercise. (Right now, you have runtime complexity of O(n^2) which you could reduce to O(n)). You only have to somehow come up with a method to revisit the first character, maybe by not using a set but by usign a map from character to number of occurrences or something like that.

Naming: why decodedString? This is not decoded in any way, this is just lowercase word. I can live with encodeString (or probably better encoded), but decodedString does not look right and does not give the reader any clue about the variable's contents.

Question 3

public static String encode(String input) {
 input= input.toLowerCase(); //ignoring case
 StringBuilder myNewString = new StringBuilder(); // for holding new String
 for(char ch : input.toCharArray()) {
 if(StringUtils.countMatches(input, ch) > 1){ // more than 1 condition
 myNewString.append(")");
 }else{ // not repeating chars
 myNewString.append("(");
 }
 }
 myNewString.toString();
}

I am using StringUtils class from Apache Commons project. So need add this to your import: import org.apache.commons.lang3.StringUtils;

Question 4

Same as I already commented at nullbyte's answer. This is not a review. BTW, what is StringUtils?

Question 5

oh forgot to mention, StringUtils is an Apache Commons project. Edited my original post and added package. Thanks

Question 6

This seems to be a little bit more concise solution than the original one. What's the time complexity of the StringUtils.countMatches() method? It's definitely O(n) which doesn't make it any better. You used a StringBuilder though.

Question 7

Here's a better solution using a hash map:

 public static String encode(String input) {
 final StringBuilder builder = new StringBuilder();
 final Map<Character, Integer> indexMap = new HashMap<>();
 for (int index = 0; index < input.length(); index++) {
 char ch = Character.toLowerCase(input.charAt(index));
 if(indexMap.containsKey(ch)) {
 int previous = indexMap.get(ch);
 builder.replace(previous, previous+1, ")");
 builder.append(')');
 }else {
 indexMap.put(ch, index);
 builder.append('(');
 }
 }
 return builder.toString();
}

It requires O(n) time and O(n) additional space, but it's way faster than yours. Also, try using a StringBuilder if you append strings in a loop.

Question 8

The above code will fail if the first character is coming twice. For eg: if my string is abca, i am expecting an output )((). But the above code will print ((().

Question 9

Not worth my karma to downvote this, but I don't consider "I can do it better, here's some code" a review.

Question 10

@mtj Well, I think it's not worth pointing out, but the OP's question is: I want to know if is this the right way to code or if there is a better way? Is this how you read it? First of all, it was not my intention to hurt someone else's feelings by "showing off" with a better solution. Secondly, there's really nothing to review, and I think I've pointed (as well as you have) the main points in my post. I could have done it more explicitly though.

Question 11

@SKV Ha! that's a good catch. Thanks. Please see my updated answer

mtj mtj 4,94211 silver badges21 bronze badges · Answer 1 · 2018-03-13 07:25:59Z

First of all, nullbyte is exactly right about the use of a StringBuilder: using a += b for Strings roughly equals new StringBuilder().append(a).append(b).toString() under the hood and thus is a quite expensive operation.

Then, your method to check whether a given charcter exists in the string after a given index is s.substring(i).contains(ch) which again creates a superfluos string object via the substring-call. Instead, simply use s.indexOf(ch, i) to find the next index of ch after the given starting index (returns -1 for "not found").

Additionally, you might consider nullbyte's suggestion to hold the seen-or-not-seen status in a set, which also was my first intuition when reading the exercise. (Right now, you have runtime complexity of O(n^2) which you could reduce to O(n)). You only have to somehow come up with a method to revisit the first character, maybe by not using a set but by usign a map from character to number of occurrences or something like that.

Naming: why decodedString? This is not decoded in any way, this is just lowercase word. I can live with encodeString (or probably better encoded), but decodedString does not look right and does not give the reader any clue about the variable's contents.

Sunilkumar V Sunilkumar V 992 bronze badges · Answer 2 · 2018-03-13 07:02:07Z

-1

\$\begingroup\$

public static String encode(String input) {
 input= input.toLowerCase(); //ignoring case
 StringBuilder myNewString = new StringBuilder(); // for holding new String
 for(char ch : input.toCharArray()) {
 if(StringUtils.countMatches(input, ch) > 1){ // more than 1 condition
 myNewString.append(")");
 }else{ // not repeating chars
 myNewString.append("(");
 }
 }
 myNewString.toString();
}

I am using StringUtils class from Apache Commons project. So need add this to your import: import org.apache.commons.lang3.StringUtils;

Share

edited Mar 13, 2018 at 7:59

answered Mar 13, 2018 at 7:02

Sunilkumar V's user avatar

Sunilkumar V Sunilkumar V

992 bronze badges

\$\endgroup\$

3

1

\$\begingroup\$ Same as I already commented at nullbyte's answer. This is not a review. BTW, what is StringUtils? \$\endgroup\$

mtj
– mtj

2018年03月13日 07:13:33 +00:00
Commented Mar 13, 2018 at 7:13
\$\begingroup\$ oh forgot to mention, StringUtils is an Apache Commons project. Edited my original post and added package. Thanks \$\endgroup\$

Sunilkumar V
– Sunilkumar V

2018年03月13日 07:55:36 +00:00
Commented Mar 13, 2018 at 7:55
\$\begingroup\$ This seems to be a little bit more concise solution than the original one. What's the time complexity of the StringUtils.countMatches() method? It's definitely O(n) which doesn't make it any better. You used a StringBuilder though. \$\endgroup\$

nullbyte
– nullbyte

2018年03月14日 00:53:31 +00:00
Commented Mar 14, 2018 at 0:53

Add a comment |

nullbyte nullbyte 4153 silver badges11 bronze badges · Answer 3 · 2018-03-13 04:53:00Z

-1

\$\begingroup\$

Here's a better solution using a hash map:

 public static String encode(String input) {
 final StringBuilder builder = new StringBuilder();
 final Map<Character, Integer> indexMap = new HashMap<>();
 for (int index = 0; index < input.length(); index++) {
 char ch = Character.toLowerCase(input.charAt(index));
 if(indexMap.containsKey(ch)) {
 int previous = indexMap.get(ch);
 builder.replace(previous, previous+1, ")");
 builder.append(')');
 }else {
 indexMap.put(ch, index);
 builder.append('(');
 }
 }
 return builder.toString();
}

It requires O(n) time and O(n) additional space, but it's way faster than yours. Also, try using a StringBuilder if you append strings in a loop.

Share

edited Mar 14, 2018 at 0:24

answered Mar 13, 2018 at 4:53

nullbyte's user avatar

nullbyte nullbyte

4153 silver badges11 bronze badges

\$\endgroup\$

4

\$\begingroup\$ The above code will fail if the first character is coming twice. For eg: if my string is abca, i am expecting an output )((). But the above code will print (((). \$\endgroup\$

Sunilkumar V
– Sunilkumar V

2018年03月13日 06:53:49 +00:00
Commented Mar 13, 2018 at 6:53
\$\begingroup\$ Not worth my karma to downvote this, but I don't consider "I can do it better, here's some code" a review. \$\endgroup\$

mtj
– mtj

2018年03月13日 07:12:57 +00:00
Commented Mar 13, 2018 at 7:12
\$\begingroup\$ @mtj Well, I think it's not worth pointing out, but the OP's question is: I want to know if is this the right way to code or if there is a better way? Is this how you read it? First of all, it was not my intention to hurt someone else's feelings by "showing off" with a better solution. Secondly, there's really nothing to review, and I think I've pointed (as well as you have) the main points in my post. I could have done it more explicitly though. \$\endgroup\$

nullbyte
– nullbyte

2018年03月14日 00:45:53 +00:00
Commented Mar 14, 2018 at 0:45
\$\begingroup\$ @SKV Ha! that's a good catch. Thanks. Please see my updated answer \$\endgroup\$

nullbyte
– nullbyte

2018年03月14日 00:46:48 +00:00
Commented Mar 14, 2018 at 0:46

Add a comment |

Stack Exchange Network

Identify duplicate characters in a String and replace them

Examples:

3 Answers 3

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Hot Network Questions

Identify duplicate characters in a String and replace them

Examples:

3 Answers 3

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Related

Hot Network Questions