Writing shorter, readable, pythonic code

Question 1

I'm trying to produce shorter, more pythonic, readable python. And I have this working solution for Project Euler's problem 8 (find the greatest product of 5 sequential digits in a 1000 digit number).

Suggestions for writing a more pythonic version of this script?

numstring = ''
for line in open('8.txt'):
 numstring += line.rstrip()
nums = [int(x) for x in numstring]
best=0
for i in range(len(nums)-4):
 subset = nums[i:i+5]
 product=1
 for x in subset:
 product *= x
 if product>best:
 best=product
 bestsubset=subset
print best
print bestsubset

For example: there's gotta be a one-liner for the below snippet. I'm sure there's a past topic on here but I'm not sure how to describe what I'm doing below.

numstring = ''
for line in open('8.txt'):
 numstring += line.rstrip()

Any suggestions? thanks guys!

Question 2

I'm working on a full answer, but for now here's the one liner

numstring = ''.join(x.rstrip() for x in open('8.txt'))

Edit: Here you go! One liner for the search. List comprehensions are wonderful.

from operator import mul
def prod(list):
 return reduce(mul, list)
numstring = ''.join(x.rstrip() for x in open('8.txt'))
nums = [int(x) for x in numstring]
print max(prod(nums[i:i+5]) for i in range(len(nums)-4))

Question 3

That's really slick. what do you think of using a lambda instead of mul? i.e. def prod(list): return reduce(lambda x,y: x*y, list)

Question 4

That works well too. I don't know why python didn't build it in - it's a pretty common requirement (even more so for Project Euler!), and it really helped having it as a built-in in R.

Question 5

Where Python offers a built-in like operator.mul, it is generally more efficient to use that rather than a lambda. For something like this, the efficiency doesn't really matter; your computer will find the answer in the blink of an eye, so you can use what you prefer. But in general, it's not bad to be in the habit of importing from operator when you are doing things with reduce() or map() or whatever.

Question 6

from operator import mul
def product(nums):
 return reduce(mul, nums)
nums = [int(c) for c in open('8.txt').read() if c.isdigit()]
result = max((product(nums[i:i+5]) for i in range(len(nums))))

Question 7

@thebjorn: I intentionally didn't subtract 4 because it didn't impact the result. If I was going to subtract I would have probably done something like range(len(nums) - 5 + 1) and maybe even named the magic number at that point.

Question 8

There are some rather elegant tricks being used here. But your use of max() with a key means that your result is set to the sequence of 5 numbers, not to their product. It would be better to simply use result = max(product(nums[i:i+5]) for i in range(len(nums)))

Question 9

I have to say I really like the list comprehension that creates nums. Never mind using .replace() to get rid of line endings; just pull only the digits and convert them to integers in one go. Elegant.

Question 10

@steveha. I misread the problem. I thought it needed the actual sequence. I'll edit the answer.

Question 11

@steveha. Ah. I see. The OP keeps the actual sequence, but the Project Euler problem does not require it.

Question 12

Here is my solution. I tried to write the most "Pythonic" code that I know how to write.

with open('8.txt') as f:
 numstring = f.read().replace('\n', '')
nums = [int(x) for x in numstring]
def sub_lists(lst, length):
 for i in range(len(lst) - (length - 1)):
 yield lst[i:i+length]
def prod(lst):
 p = 1
 for x in lst:
 p *= x
 return p
best = max(prod(lst) for lst in sub_lists(nums, 5))
print(best)

Arguably, this is one of the ideal cases to use reduce so maybe prod() should be:

# from functools import reduce # uncomment this line for Python 3.x
from operator import mul
def prod(lst):
 return reduce(mul, lst, 1)

I don't like to try to write one-liners where there is a reason to have more than one line. I really like the with statement, and it's my habit to use that for all I/O. For this small problem, you could just do the one-liner, and if you are using PyPy or something the file will get closed when your small program finishes executing and exits. But I like the two-liner using with so I wrote that.

I love the one-liner by @Steven Rumbalski:

nums = [int(c) for c in open('8.txt').read() if c.isdigit()]

Here's how I would probably write that:

with open("8.txt") as f:
 nums = [int(ch) for ch in f.read() if ch.isdigit()]

Again, for this kind of short program, your file will be closed when the program exits so you don't really need to worry about making sure the file gets closed; but I like to make a habit of using with.

Question 13

Yeah I think that the definition of sub_lists(lst, length) makes a lot of sense. It was confusing to use the magic number as in len(nums)-4.

Question 14

Using a definition of prod like that is significantly slower than using the builtin mul from operator.

Question 15

As far as explaining what that last bit was, first you create an empty string called numstring:

numstring = ''

Then you loop over every line of text (or line of strings) in the txt file 8.txt:

for line in open('8.txt'):

And so for every line you find, you want to add the result of line.rstrip() to it. rstrip 'strips' the whitespace (newlines,spaces etc) from the string:

 numstring += line.rstrip()

Say you had a file, 8.txt that contains the text: LineOne \nLyneDeux\t\nLionTree you'd get a result that looked something like this in the end:

>>>'LineOne' #loop first time
>>>'LineOneLyneDeux' # second time around the bush
>>>'LineOneLyneDeuxLionTree' #final answer, reggie

Question 16

Thanks for the thoughtful explanation @TankorSmash. I should have been clearer in my question, what I meant was: I dont know how to describe what I'm doing here succinctly enough to search for past topics on it.

Question 17

Here's a full solution! First read out the number:

with open("8.txt") as infile:
 number = infile.replace("\n", "")

Then create a list of lists with 5 consecutive numbers:

cons_numbers = [list(map(int, number[i:i+5])) for i in range(len(number) - 4)]

Then find the largest and print it:

print(max(reduce(operator.mul, nums) for nums in cons_numbers))

If you're using Python 3.x you need to replace reduce with functools.reduce.

Question 18

you can just replace '\n' with ''

Question 19

@JBernardo: sure, but that'll split on any whitespace, and "\n" makes the intent more clear.

Question 20

@JBernardo: ah now I see, yeah that's probably better.

Question 21

@nightcracker: range(len(number) - 5) is a bug. Test it on '123456789'. It misses the digit 9.

Question 22

map, reduce, and lambda aren't consider Pythonic by Guido ( artima.com/weblogs/viewpost.jsp?thread=98196 ).

Rob Volgman 2,1143 gold badges18 silver badges28 bronze badges · Answer 1 · 2012-07-27 18:04:12Z

4

I'm working on a full answer, but for now here's the one liner

numstring = ''.join(x.rstrip() for x in open('8.txt'))

Edit: Here you go! One liner for the search. List comprehensions are wonderful.

from operator import mul
def prod(list):
 return reduce(mul, list)
numstring = ''.join(x.rstrip() for x in open('8.txt'))
nums = [int(x) for x in numstring]
print max(prod(nums[i:i+5]) for i in range(len(nums)-4))

Share

Improve this answer

edited Jul 27, 2012 at 18:34

answered Jul 27, 2012 at 18:04

Rob Volgman's user avatar

Rob Volgman

2,1143 gold badges18 silver badges28 bronze badges

Sign up to request clarification or add additional context in comments.

3 Comments

dyln

dyln Over a year ago

That's really slick. what do you think of using a lambda instead of mul? i.e. def prod(list): return reduce(lambda x,y: x*y, list)

2012年07月27日T19:00:47.25Z+00:00

Rob Volgman

Rob Volgman Over a year ago

That works well too. I don't know why python didn't build it in - it's a pretty common requirement (even more so for Project Euler!), and it really helped having it as a built-in in R.

2012年07月27日T19:03:24.507Z+00:00

steveha

steveha Over a year ago

Where Python offers a built-in like operator.mul, it is generally more efficient to use that rather than a lambda. For something like this, the efficiency doesn't really matter; your computer will find the answer in the blink of an eye, so you can use what you prefer. But in general, it's not bad to be in the habit of importing from operator when you are doing things with reduce() or map() or whatever.

2012年07月27日T19:09:48.973Z+00:00

Steven Rumbalski 45.8k10 gold badges96 silver badges125 bronze badges · Answer 2 · 2012-07-27 18:06:53Z

4

from operator import mul
def product(nums):
 return reduce(mul, nums)
nums = [int(c) for c in open('8.txt').read() if c.isdigit()]
result = max((product(nums[i:i+5]) for i in range(len(nums))))

Share

Improve this answer

edited Jul 27, 2012 at 20:11

answered Jul 27, 2012 at 18:06

Steven Rumbalski's user avatar

Steven Rumbalski

45.8k10 gold badges96 silver badges125 bronze badges

6 Comments

Steven Rumbalski

Steven Rumbalski Over a year ago

@thebjorn: I intentionally didn't subtract 4 because it didn't impact the result. If I was going to subtract I would have probably done something like range(len(nums) - 5 + 1) and maybe even named the magic number at that point.

2012年07月27日T18:43:26.42Z+00:00

steveha

steveha Over a year ago

There are some rather elegant tricks being used here. But your use of max() with a key means that your result is set to the sequence of 5 numbers, not to their product. It would be better to simply use result = max(product(nums[i:i+5]) for i in range(len(nums)))

2012年07月27日T19:08:07.507Z+00:00

steveha

steveha Over a year ago

I have to say I really like the list comprehension that creates nums. Never mind using .replace() to get rid of line endings; just pull only the digits and convert them to integers in one go. Elegant.

2012年07月27日T19:19:50.49Z+00:00

Steven Rumbalski

Steven Rumbalski Over a year ago

@steveha. I misread the problem. I thought it needed the actual sequence. I'll edit the answer.

2012年07月27日T20:10:41.113Z+00:00

Steven Rumbalski

Steven Rumbalski Over a year ago

@steveha. Ah. I see. The OP keeps the actual sequence, but the Project Euler problem does not require it.

2012年07月27日T20:12:32.99Z+00:00

|

steveha 77.2k21 gold badges94 silver badges119 bronze badges · Answer 3 · 2012-07-27 19:04:15Z

Here is my solution. I tried to write the most "Pythonic" code that I know how to write.

with open('8.txt') as f:
 numstring = f.read().replace('\n', '')
nums = [int(x) for x in numstring]
def sub_lists(lst, length):
 for i in range(len(lst) - (length - 1)):
 yield lst[i:i+length]
def prod(lst):
 p = 1
 for x in lst:
 p *= x
 return p
best = max(prod(lst) for lst in sub_lists(nums, 5))
print(best)

Arguably, this is one of the ideal cases to use reduce so maybe prod() should be:

# from functools import reduce # uncomment this line for Python 3.x
from operator import mul
def prod(lst):
 return reduce(mul, lst, 1)

I don't like to try to write one-liners where there is a reason to have more than one line. I really like the with statement, and it's my habit to use that for all I/O. For this small problem, you could just do the one-liner, and if you are using PyPy or something the file will get closed when your small program finishes executing and exits. But I like the two-liner using with so I wrote that.

I love the one-liner by @Steven Rumbalski:

nums = [int(c) for c in open('8.txt').read() if c.isdigit()]

Here's how I would probably write that:

with open("8.txt") as f:
 nums = [int(ch) for ch in f.read() if ch.isdigit()]

Again, for this kind of short program, your file will be closed when the program exits so you don't really need to worry about making sure the file gets closed; but I like to make a habit of using with.

Yeah I think that the definition of sub_lists(lst, length) makes a lot of sense. It was confusing to use the magic number as in len(nums)-4.
Using a definition of prod like that is significantly slower than using the builtin mul from operator.

TankorSmash 12.9k6 gold badges71 silver badges108 bronze badges · Answer 4 · 2012-07-27 18:02:30Z

As far as explaining what that last bit was, first you create an empty string called numstring:

numstring = ''

Then you loop over every line of text (or line of strings) in the txt file 8.txt:

for line in open('8.txt'):

And so for every line you find, you want to add the result of line.rstrip() to it. rstrip 'strips' the whitespace (newlines,spaces etc) from the string:

 numstring += line.rstrip()

Say you had a file, 8.txt that contains the text: LineOne \nLyneDeux\t\nLionTree you'd get a result that looked something like this in the end:

>>>'LineOne' #loop first time
>>>'LineOneLyneDeux' # second time around the bush
>>>'LineOneLyneDeuxLionTree' #final answer, reggie

Thanks for the thoughtful explanation @TankorSmash. I should have been clearer in my question, what I meant was: I dont know how to describe what I'm doing here succinctly enough to search for past topics on it.

orlp 119k39 gold badges227 silver badges325 bronze badges · Answer 5 · 2012-07-27 18:02:51Z

0

Here's a full solution! First read out the number:

with open("8.txt") as infile:
 number = infile.replace("\n", "")

Then create a list of lists with 5 consecutive numbers:

cons_numbers = [list(map(int, number[i:i+5])) for i in range(len(number) - 4)]

Then find the largest and print it:

print(max(reduce(operator.mul, nums) for nums in cons_numbers))

If you're using Python 3.x you need to replace reduce with functools.reduce.

Share

Improve this answer

edited Jul 27, 2012 at 19:27

answered Jul 27, 2012 at 18:02

orlp's user avatar

orlp

119k39 gold badges227 silver badges325 bronze badges

9 Comments

JBernardo

JBernardo Over a year ago

you can just replace '\n' with ''

2012年07月27日T18:05:03.467Z+00:00

orlp

orlp Over a year ago

@JBernardo: sure, but that'll split on any whitespace, and "\n" makes the intent more clear.

2012年07月27日T18:05:32.523Z+00:00

orlp

orlp Over a year ago

@JBernardo: ah now I see, yeah that's probably better.

2012年07月27日T18:08:15.67Z+00:00

Steven Rumbalski

Steven Rumbalski Over a year ago

@nightcracker: range(len(number) - 5) is a bug. Test it on '123456789'. It misses the digit 9.

2012年07月27日T18:23:12.35Z+00:00

thebjorn

thebjorn Over a year ago

map, reduce, and lambda aren't consider Pythonic by Guido ( artima.com/weblogs/viewpost.jsp?thread=98196 ).

2012年07月27日T18:37:23.373Z+00:00

|

CollectivesTM on Stack Overflow

Writing shorter, readable, pythonic code

5 Answers 5

3 Comments

6 Comments

2 Comments

1 Comment

9 Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Hot Network Questions

CollectivesTM on Stack Overflow

5 Answers 5

3 Comments

6 Comments

2 Comments

1 Comment

9 Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Related