Parsing a string pattern - Python

Asked 10 years, 7 months ago

Viewed 5k times

I have a string pattern (for a xml test reporter) in the following pattern:

'testsets.testcases.[testset].[testcase]-[date-stamp]'

For example:

a='testsets.testcases.test_different_blob_sizes.TestDifferentBlobSizes-20150430130436'

I know I always can parse the testset and testcase names by doing:

temp = a.split("-")[0]
current = temp.split(".")
testset = '.'.join(current[:-1]) + ".py"
testcase = current[-1]

However, I want to accomplish that using a more pythonic way, like regex or any other expression that I would do it in a single line. How can I accomplish that?

Improve this question

edited May 19, 2015 at 16:28

asked May 19, 2015 at 16:20

cybertextron's user avatar

cybertextron

11k32 gold badges117 silver badges220 bronze badges

possible duplicate of Python Regular Expression example

Joel Hinz
– Joel Hinz

2015年05月19日 16:26:34 +00:00
Commented May 19, 2015 at 16:26
What are s and its name that you suddenly begin to use?

Malik Brahimi
– Malik Brahimi

2015年05月19日 16:27:31 +00:00
Commented May 19, 2015 at 16:27
@MalikBrahimi sorry will update the question

cybertextron
– cybertextron

2015年05月19日 16:28:08 +00:00
Commented May 19, 2015 at 16:28
@JoelHinz I dont think they are possible duplicates ... I'm looking for a more general pattern than the one asked in that question

cybertextron
– cybertextron

2015年05月19日 16:31:01 +00:00
Commented May 19, 2015 at 16:31

Add a comment |

3 Answers 3

Sorted by: Reset to default

You can try:

testset, testcase = re.search('(.*)\.(.*)-.*', a).group(1, 2)
testset += '.py'

re.search returns a MatchObject on matches, and it has a group method we can use to extract match groups for the regex ("()"s in the regex).

Improve this answer

edited May 19, 2015 at 16:34

answered May 19, 2015 at 16:33

zw324's user avatar

zw324

27.2k16 gold badges88 silver badges119 bronze badges

1 Comment

Malik Brahimi

Malik Brahimi Over a year ago

This an incorrect regex. Look at the OP where brackets indicated the desired groups in a certain string.

2015年05月19日T16:42:14.45Z+00:00

Just use the groups that are obtained from the regular expression searched groups:

data = re.search(r'.+\..+\.(.+)\.(.+)-(\d+)', string).groups()

Improve this answer

answered May 19, 2015 at 16:35

Malik Brahimi's user avatar

Malik Brahimi

16.8k7 gold badges47 silver badges76 bronze badges

Comments

If you strictly want to pull out the testset and testcase, i.e. "test_different_blob_sizes" and "TestDifferentBlobSizes", as in the first part of your question, you can just do:

testset, testcase = re.split('[.-]',s)[2:4]

For compact regexp-based code based on what you have, see Ziyao Wei's response.

Improve this answer

answered May 19, 2015 at 16:46

user2742051's user avatar

user2742051

11 bronze badge

Comments

Your Answer

Draft saved

Draft discarded

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking "Post Your Answer", you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

lang-py

CollectivesTM on Stack Overflow

Parsing a string pattern - Python

3 Answers 3

1 Comment

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Linked

Hot Network Questions

CollectivesTM on Stack Overflow

3 Answers 3

1 Comment

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Linked

Related