Asked 8 years, 2 months ago

Viewed 132 times

$\begingroup$

I was reading DeepMind's paper on I2A's and realized that the sizes of the hidden layers in their model were all like 32, 64, 256, and so on: all powers of 2. I have found the same thing in other papers.

Is there any performance reason for it? Maybe related to data structure alignment?

More concretely, I would like to know if I should use this "special" sizes when training my own models.

Improve this question

edited Sep 12, 2017 at 7:31

Rodrigo de Azevedo's user avatar

Rodrigo de Azevedo

9749 silver badges20 bronze badges

asked Sep 8, 2017 at 15:20

nanaki's user avatar

nanaki

2231 silver badge6 bronze badges

$\endgroup$

Add a comment |

1 Answer 1

Sorted by: Reset to default

$\begingroup$

While you can only be 100% certain when you ask the authors, most authors use this simply because you have to choose one value. The specific value doesn't matter too much, only the order of magnitude. Taking a power of 2 seems to be a natural choice.

You can also take a setup which uses a power of two and reduce the number by one. The computation time should be roughly equal, probably be a bit lower. If it is noticeably higher, there might be a performance benefit of using the choice of the author.

Your Answer

Draft saved

Draft discarded

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking "Post Your Answer", you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

Stack Exchange Network

Does the specific size of matrices affect the performance of matrix operations?

1 Answer 1

See also

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Hot Network Questions

Does the specific size of matrices affect the performance of matrix operations?

1 Answer 1

See also

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Related

Hot Network Questions