Check for occurences of string in other string

Question 1

Challenge

I'm sure you read the title, and came in to farm your rep, thinking its kids' stuff, but think again! You have to challenge each other in the shortest code to count the occurences of a string in another. For example, given the following input:

aaaabbbbsssffhd

as a string, and the string

s

should output

3

Rules

Just before you smile and say, "Hey, I'll use ----," read this:

No use of external libraries, or your language's API. You have to implement it manually. Which means you can't use your language's built-in function or method for counting occurences
No file I/O
No connecting with a server, website, et cetera
In a case of `ababa`, where it starts with `aba` and if you read the last 3 letters it's also `aba`, you only count one*

Thank you @ProgramFOX for that (the last rule)!
*Hint: When you count occurences, you can remove the ones you counted to avoid disobeying this rule

I think the last 2 rules are just for rule benders!

Winning Criterion

As previously stated the winner is the code with the less bytes used. The winner will be announced five days later (15 June 2014)

My Small Answer

Here's my C++ answer, where it assumes that the li variable holds the string to check occurences in, and l is the string to look for in f:

Ungolfed

int c = 0;
while (li.find(lf) != string::npos)
{
 int p = li.find(lf);
 int l = p + lf.length() - 1;
 for (p = p; p <= l; p++)
 {
 li[p] = static_cast<char>(8);
 }
 ++c;
}

Of course, to use std::string, you have to include the string header file!

Golfed

int c=0;while(li.find(lf)!=string::npos){int p=li.find(lf);int l=p+lf.length()-1;for(p=p;p<=l;p++){li[p]=static_cast<char>(8);}++c;}

Result

The variable c is going to be the value of how many times the string was found

Enjoy!

Winner

After a long wait @Dennis answer wins with only 3 bytes, written in GolfScript

Question 2

Should we count the same character twice? For example, if the input is ababa and aba, should we output 1 or 2? The third a is both the end of the first aba and the begin of the second aba.

Question 3

Does language's API mean libraries that can be imported (that come with the language) or methods of builtin datatypes or commands themselves?

Question 4

Does it need to deal with strings longer than s for matching? If not: I have a 5 char J solution: +/a=b

Question 5

Does input come through variables or STDIN?

Question 6

Function or program?

Question 7

GolfScript, 3 bytes

/,(

Assumes string and substring are on the stack.

Try it online.

How it works

/ # Split the string around occurrences of the substring.
, # Get the length of the split array.
( # Subtract 1.

Question 8

Impressive.. But I have a small question, did you learn GolfScript just for code golf and solving code problems?

Question 9

I did, but I use it occasionally for other tasks by now. Need an ASCII table? golfscript <<< '127,32,-""+' is faster than opening a web browser.

Question 10

@Dennis [win key] jqt [return] a. [return] is even quicker ;)

Question 11

@Synthetica: Well, it was just an example. But jqt sounds interesting. What is it?

Question 12

@Dennis It's the standard J console that comes with J (it's one of those graphical consoles, like the iPython QTConsole)

Question 13

JavaScript 32

Nothing really interesting here...

(p=prompt)().split(p()).length-1

split main purpose is to create an array from a string using the delimiter in argument.

Question 14

split() isn't part of "your language's API"?

Question 15

it is but its purpose is not to count occurrences of a string... so where is the limit of authorized APIs ?

Question 16

@Michael I apologize of being vague, this is acceptable.. I edited the question to rephrase to "you can't use your language's built-in function or method for counting occurences", so if it is not intended for the purpose of the question, you can use it...

Question 17

@404NotFound Good to know ... this is my answer too then? :) ... (+1)

Question 18

This is the only answer which doesn't assume that the vars have already been initialized, so you've got my upvote. If you assumed the vars were initialized like all the other answers, you would've gotten a.split(b).length-1 19. Maybe @svidgen should clarify initialization in the question.

Question 19

J (7)

No use of external libraries Check! , or your language's API. Check...? I don't know what an language API is. You have to implement it manually Check! No file I/O Check! No connecting with a server, website, et cetera Check!

+/a E.b

How it works:

E. is WindowedMatch: the J Refsheet gives 're' E. 'reread' as example. This gives 1 0 1 0 0 0. Then, the only thing left to do is simply adding this with +/ (basically sum).

I don't think this counts as using your language's built-in function or method for counting occurences, but that's disputable.

EDIT: Just to be clear:

 +/'aba'E.'ababa'
2

Question 20

I thought ababa -> aba was to return 1?

Question 21

@KyleKanos It was, but the author specifically said @Synthetica Absolutley go for it! when I asked him, so I suppose it's okay.

Question 22

C# - 73

//a = "aba";
//b = "ababa";
Console.Write(b.Split(new string[]{a},StringSplitOptions.None).Length-1);
// output = "1"

Question 23

It's wrong! aba appears twice in ababa: aba ba ab aba.

Question 24

@Runemoro "In a case of ababa, where it starts with aba and if you read the last 3 letters it's also aba, you only count one*" << In the rules..

Question 25

Oh, didn't read...

Question 26

Python 2.x - (削除) 49 23 (削除ここまで) 22 bytes

This is assuming variable input is okay. Both strings can be of any length.

Shortened @avall.

a='s'
b='aaaabbbbsssffhd'
print~-len(b.split(a))

49 bytes version, counts every instance of the substring ('aba' is in 'ababa' twice).

a='s'
b='aaaabbbbsssffhd'
print sum(a==b[i:i+len(a)]for i in range(len(b)))

Question 27

Worked with every test I put it in...

Question 28

And language's API means you have to use your own function or method, not use the language's built-in way to do it (if it has one, I'm a C++ programmer;( )

Question 29

Well the second one absolutely is language's own method for it. Also, the first one outputs 3 if a='aa' and b is the same. Is that okay?

Question 30

Yes, absolutely

Question 31

Also, it outputs 2 for the a='aba'; b='ababa' case.

Question 32

Powershell 32

($args[0]-split$args[1]).count-1

Works like this:

PS C:\MyFolder> .\ocurrences.ps1 ababa aba
1

Explanation: Uses -split to separate the first argument by the second one, returns the size of the array resulting from the split (minus 1)

Question 33

Applescript, 106 bytes

Applescript is a fun, but silly language to golf in.

on run a
set AppleScript's text item delimiters to (a's item 1)
(count of (a's item 2)'s text items)-1
end

Run with osascript:

$ osascript instr.scpt s aaaabbbbsssffhd
3
$

Question 34

C# - 66 Bytes

//s = "aba"
//t = "ababa"
Console.Write(t.Split(new[]{s},StringSplitOptions.None).Length-1);
//Output: 1

Question 35

lol. That's basically my answer. Usually on this site, if you cant come up with something different than an existing answer, you can give hints to the existing answer on how to save a few extra bytes.

Question 36

@malik Listen, I wasn't able to comment at that time. Otherwise I'd commented on your answer. I'm willing to take my answer down, if you want to.

Question 37

Fair enough. I don't mind either way. I was just letting you know.

Question 38

C (削除) 130 (削除ここまで) 120

Note: will probably crash if called with incorrect arguments.

r;main(int c,char**a){char*p=*++a,*q,*t;while(*p){for(q=a[1],t=p;*q&&*q==*t;q++)t++;*q?p++:(p=t,r++);}printf("%d\n",r);}

Ungolfed (kinda):

int main(int argc, char *argv[]) {
 int result = 0;
 char *ptr = argv[1];
 while (*ptr) {
 char *tmp, *tmp2 = ptr;
 // str(n)cmp
 for (tmp = argv[2]; *tmp; tmp++, tmp2++)
 if (*tmp != *tmp2)
 break;
 if (*tmp) {
 ptr++;
 } else {
 result++;
 ptr += tmp;
 }
 }
 printf("%d\n", result);
}

Old version with strstr and strlen: 103

l;main(int c,char**a){char*p=a[1];l=strlen(a[2]);while(c++,p>l)p=strstr(p,a[2])+l;printf("%d\n",c-5);}

Question 39

Delphi XE3 (113)

Takes 2 strings, removes substring from string and substracts new length from old length followed by a division of the substring length.

function c(a,b:string):integer;begin c:=(Length(a)-Length(StringReplace(a,b,'',[rfReplaceAll])))div Length(b)end;

Testing:

c('aaaabbbbsssffhd','s') = 3
c('aaaabbbbsssffhd','a') = 4
c('ababa','aba') = 1
c('ababa','c') = 0

Question 40

Lua (48)

So i thought I could just submit another answer, this time in lua. Its very possible this could be improved a lot, im very new to this.

print((a.len(a)-a.len(a.gsub(a,b,"")))/b.len(b))

Question 41

Fortran 90: 101

Standard abuse of implicit typing, works for any length arrays a and b, though one should expect that len(a) < len(b).

function i();i=0;k=len(trim(b))-1;do j=1,len(trim(a))-k;if(a(j:j+k)==b(1:1+k))i=i+1;enddo;endfunction

This function must be contained within a full program to work. a and b are received from stdin and can be entered either on the same line (either comma or space separated) or on different lines. Compile via gfortran -o main main.f90 and execute as you would any other compiled program.

program main
 character(len=256)::a,b
 read*,a,b
 print*,i()
 contains
 function i()
 i=0
 k=len(trim(b))-1
 do j=1,len(trim(a))-k
 if(a(j:j+k)==b(1:1+k))i=i+1
 end do
 end function
end program main

Tests:

>ababa aba
2

I could make the above return 1 if I add 4 characters (,k+1) for the do loop

> aaaabbbbbsssffhd s
3

Question 42

Mathematica (削除) 26 (削除ここまで) 23

Works like Dennis' algorithm, but wordier:

Length@StringCases[a,b]

Three chars shaved off by Szabolics.

Question 43

Length@StringCases[a,b] is shorter. This problem is not well specified because if we're not allowed to use StringCount, it's not clear whether any of the other builtin String* functions should be allowed either ... they're all based on the same underlying code.

Question 44

Thanks. I forgot about StringCases. It was unclear to me what the OP meant by the language's API.

Question 45

Using the new composition operator in Mathematica 10, we could even just write a function as Length@*StringCases. This is shorter than Length@StringCases[##]& if we were to go for just writing a function but not actually apply it to a and b.

Question 46

C++ 225

int n,k,m;
int main()
{
string s1,s2;
cin>>s1;
cin>>s2;
int x=s1.size(),y=s2.size();
if(x>=y)
{
for(int i=0;i<x;i++)
{
k=0,m=0;
for(int j=0;j<y;j++)
{
if(s2[j]==s1[i+m])
{
 k++,m++;
}
else break;
}
if(k==y)
{
n++;
i+=(y-1);
}
}
}
cout<<n<<endl;
return 0;
}

Question 47

Java (38)

System.out.print(a.split(b).length-1);

(The question did not require a complete program or function.)

Question 48

Cobra - 25

print a.split(b).length-1

Question 49

K/Kona 6

+/y~'x

where x is the string and y the substring. ~ is the negate operator, with ', it is applied to every element in x; it will return 0 if it does not match and 1 if it does match. Since it is applied element-wise, the result of y~'x is a vector, the +/ then sums the result giving the total number of occurrences.

Unfortunately, this method requires that y is only one character, otherwise we will be comparing a multi-character string to a single character string, resulting in a length error.

Question 50

Pascal, 210 B

In Standard Pascal a string has necessarily a minimum length of two char values.^† Therefore it is not possible to find the count of occurrences of a "string" of length one in another string.

type c=char;z=integer;function f(s:array[m..n:z]of c;t:array[p..q:z]of c):z;var
i,j,a,r:z;begin r:=0;i:=n-q+p;for i:=m to i do begin a:=1;for j:=0 to q-p do
begin a:=a*ord(s[i+j]=t[p+j])end;r:=r+a end;f:=r end;

Antighoulfed:

 function occurrencesCount(
 { In Pascal a string is a `packed array[1..N] of char`, N ≥ 2. }
 sample: packed array[sampleFirst..sampleLast: integer] of char;
 needle: packed array[needleFirst..needleLast: integer] of char
 ): integer;
 var
 sampleIndex, needleOffset, intermediateResult, result: integer;
 begin
 result ≔ 0;
 
 sampleIndex ≔ sampleLast − needleLast + needleFirst;
 for sampleIndex ≔ sampleFirst to sampleIndex do
 begin
 intermediateResult ≔ 1;
 
 for needleOffset ≔ 0 to needleLast − needleFirst do
 begin
 { The product is zero if any factor is zero. }
 intermediateResult ≔ intermediateResult *
 ord(
 sample[sampleIndex + needleOffset]
 =
 needle[needleFirst + needleOffset]
 )
 end;
 
 result ≔ result + intermediateResult
 end;
 { In Pascal the function’s result is defined by assigning a value
 to the implicitly declared variable bearing the function’s name. }
 occurrencesCount ≔ result
 end;

_{^† Extended Pascal allows an empty string (''), too.}

Dennis Dennis 212k41 gold badges379 silver badges829 bronze badges · Accepted Answer · 2014-06-10 17:04:11Z

7

\$\begingroup\$

GolfScript, 3 bytes

/,(

Assumes string and substring are on the stack.

Try it online.

How it works

/ # Split the string around occurrences of the substring.
, # Get the length of the split array.
( # Subtract 1.

Share

Improve this answer

answered Jun 10, 2014 at 17:04

Dennis's user avatar

Dennis Dennis

212k41 gold badges379 silver badges829 bronze badges

\$\endgroup\$

8

\$\begingroup\$ Impressive.. But I have a small question, did you learn GolfScript just for code golf and solving code problems? \$\endgroup\$

user19785
– user19785

2014年06月10日 17:05:43 +00:00
Commented Jun 10, 2014 at 17:05
1

\$\begingroup\$ I did, but I use it occasionally for other tasks by now. Need an ASCII table? golfscript <<< '127,32,-""+' is faster than opening a web browser. \$\endgroup\$

Dennis
– Dennis

2014年06月10日 17:09:10 +00:00
Commented Jun 10, 2014 at 17:09
\$\begingroup\$ @Dennis [win key] jqt [return] a. [return] is even quicker ;) \$\endgroup\$

ɐɔıʇǝɥʇuʎs
– ɐɔıʇǝɥʇuʎs

2014年06月10日 17:12:29 +00:00
Commented Jun 10, 2014 at 17:12
\$\begingroup\$ @Synthetica: Well, it was just an example. But jqt sounds interesting. What is it? \$\endgroup\$

Dennis
– Dennis

2014年06月10日 17:17:54 +00:00
Commented Jun 10, 2014 at 17:17
\$\begingroup\$ @Dennis It's the standard J console that comes with J (it's one of those graphical consoles, like the iPython QTConsole) \$\endgroup\$

ɐɔıʇǝɥʇuʎs
– ɐɔıʇǝɥʇuʎs

2014年06月10日 17:24:05 +00:00
Commented Jun 10, 2014 at 17:24

| Show 3 more comments

Stack Exchange Network

Check for occurences of string in other string

Challenge

Rules

Winning Criterion

My Small Answer

Ungolfed

Golfed

Result

Enjoy!

Winner

18 Answers 18

GolfScript, 3 bytes

How it works

JavaScript 32

J (7)

C# - 73

Python 2.x - (削除) 49 23 (削除ここまで) 22 bytes

Powershell 32

Applescript, 106 bytes

C# - 66 Bytes

C (削除) 130 (削除ここまで) 120

Delphi XE3 (113)

Lua (48)

Fortran 90: 101

Mathematica (削除) 26 (削除ここまで) 23

C++ 225

Java (38)

Cobra - 25

K/Kona 6

Pascal, 210 B

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Hot Network Questions

Check for occurences of string in other string

Challenge

Rules

Winning Criterion

My Small Answer

Ungolfed

Golfed

Result

Enjoy!

Winner

18 Answers 18

GolfScript, 3 bytes

How it works

JavaScript 32

J (7)

C# - 73

Python 2.x - (削除) 49 23 (削除ここまで) 22 bytes

Powershell 32

Applescript, 106 bytes

C# - 66 Bytes

C (削除) 130 (削除ここまで) 120

Delphi XE3 (113)

Lua (48)

Fortran 90: 101

Mathematica (削除) 26 (削除ここまで) 23

C++ 225

Java (38)

Cobra - 25

K/Kona 6

Pascal, 210 B

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Related

Hot Network Questions