Search Substring in String collection

Asked 11 years, 9 months ago

Viewed 435 times

I have a big collection of strings a[1], ..., a[N] where N is about several millions. I am provided a string m and I need to iterate over all the strings a[i] that contain m. In other words, I need to find all the strings a[i] having m as substring.

What would be the most efficient data structure for that problem? I need to be very careful with memory as I am working with a big number of strings.

Improve this question

edited Dec 1, 2013 at 0:18

Michaël Le Barbier's user avatar

Michaël Le Barbier

2,07514 silver badges25 bronze badges

asked Nov 30, 2013 at 22:33

Ilya Gazman's user avatar

Ilya Gazman Ilya Gazman

2955 silver badges15 bronze badges

Is this related to specific programming language or is it acceptable to use databases?

Marek Sebera
– Marek Sebera

2013年11月30日 23:01:11 +00:00
Commented Nov 30, 2013 at 23:01
you want a string searching algo? check boyer-moore and variants

ratchet freak
– ratchet freak

2013年11月30日 23:11:37 +00:00
Commented Nov 30, 2013 at 23:11
@MarekSebera database is acceptable

Ilya Gazman
– Ilya Gazman

2013年12月01日 07:45:53 +00:00
Commented Dec 1, 2013 at 7:45

Add a comment |

1 Answer 1

Sorted by: Reset to default

if you skip n letters then you just need to check if the letter is in the first n of the input string and then check back again to see if it matches

this means creating a data structure (just a 256 long array with information about where the string might start) for the input string but allows the other collection of strings to remain unordered

also check out this blog post, and the boyer moore algorithm

Improve this answer

answered Nov 30, 2013 at 23:29

ratchet freak's user avatar

ratchet freak ratchet freak

26k2 gold badges65 silver badges101 bronze badges

Add a comment |

Your Answer

Draft saved

Draft discarded

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking "Post Your Answer", you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

Stack Exchange Network

Search Substring in String collection

1 Answer 1

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Hot Network Questions

Search Substring in String collection

1 Answer 1

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Related

Hot Network Questions