Assembling edges of a graph

Question 1

I'm making a neural network that comprises five populations of feature-selective neurons and one population of non-selective neurons. Each neuron in this network receives connections from

c * f * N_E randomly selected neurons from each of the selective populations
c * (1 - f * p) * N_E randomly selected non-selective neurons.

Below is the code I've written to define all of the connections in this network. Specifically, the code produces two arrays that together represent the source (ee_i) and target (ee_j) nodes of every connection.

Can anyone think of a way to reorganize this code to speed it up?

import numpy as np
N_E = 8000
p = 5
f = 0.1
c = 0.2
encoders = np.random.choice(np.arange(N_E), (p, int(f * N_E)), False)
non_selec = np.array([i for i in range(N_E) if i not in encoders])
ee_i = []
ee_j = np.repeat(np.arange(N_E), int(c * N_E))
for j in range(N_E):
 ee_i = np.concatenate([ee_i,
 np.concatenate([np.random.choice(encoders[0, :],
 int(c * f *
 N_E),
 False),
 np.random.choice(encoders[1, :],
 int(c * f *
 N_E),
 False),
 np.random.choice(encoders[2, :],
 int(c * f *
 N_E),
 False),
 np.random.choice(encoders[3, :],
 int(c * f *
 N_E),
 False),
 np.random.choice(encoders[4, :],
 int(c * f *
 N_E),
 False),
 np.random.choice(non_selec,
 int(c * (1 - f *
 p) *
 N_E),
 False)])])

Question 2

Thanks for an interesting question. My review starts with your main interest -- speeding up what you've already got. It then goes to discuss a number of other topics you may be interested in.

Speedup

To figure out what's slow in your code, you've got to profile it. Python make this easy with the line_profiler package.
I used this tool on your code and found that the np.concatenate was responsible for 88% of the run time.

After thinking about it, this didn't surprise me. The way you build ee_i is very inefficient. Keep in mind the output of np.concatenate() is a new array. So every loop iteration requires instantiating a new NumPy array and copying all the existing ee_i data into it.

A better way is to pre-allocate the ee_i array with something like this:
```
ee_j = np.repeat(np.arange(num_neurons), 
 num_cnxns)
ee_i = np.zeros_like(ee_j, dtype = 'int')
### REST OF CODE ###
```
Then, instead of copying all of the existing ee_i data into a new array every time you go through the loop, just update the relevant positions of ee_i using NumPy slice notation.
```
ee_i[start_idx:stop_idx] = ##<code that supplies new values>##
```
This sped up the code by>5-fold for me.

Other issues

Why encoders = np.random.choice(np.arange(N_E), (p, int(f * N_E)), False) instead of just arbitrarily specificying that the first f * N_E / p neurons are part of the first population of encoders, the next, from f * N_E /p + 1 to 2 * f * N_E /p, the second population of encoders, etc. Since the number and ordering neurons is (presumably) arbitrary, it shouldn't matter if you number them in a way that's convenient for you.
It's tough to understand your variable names. What is f? What is c? It would be nice to have a docstring that explained at least this much.

A different (faster) approach

Conceptually, the connections between neurons can be thought of as a square matrix of boolean values. If there is a connection from neuron i to neuron j, then this matrix will have a True at position (i, j). This is a much different representation than the array (i.e. list) of sources and targets you are trying to generate. I find it much more intuitive.

The downside of the matrix representation is that many of the entries will be 0, and it requires creating a very large matrix, which is memory (and sometimes speed) inefficient. Thus, the lists of sources and targets you are trying to create are much more efficient.

However sparse matrices offer a way to get the intuitive advantages of the matrix representation with the speed and memory properties of the "linked list" representation you are shooting for. In particular SciPy's COO sparse matrix format has an underlying representation that is nearly exactly the same as your list of target nodes and source nodes (i.e. a list of the column indices and a list of the row indices with nonzero values).

Here's a version of that approach:

from scipy import sparse
import numpy as np
def rand_col_sparse_bool_arr(m, n, c):
 """
 Returns a sparse boolean matrix (COO format) of size m by n with a total of c nonzero entries per column
 :param: m int number of rows in output matrix
 :param: n int number of columns in output matrix; every column has c nonzero entries
 :param: c int number of nonzero entries in each column of output matrix
 """
 if c > m:
 raise(ValueError, 'c must be less than m.')
 data = np.repeat([True], repeats = c*n)
 row_idxs = np.random.randint(m, size = c*n)
 col_idxs = np.repeat(np.arange(n), repeats = c)
 # return the sparse matrix
 return sparse.coo_matrix((data, (row_idxs, col_idxs)), 
 dtype = bool, 
 shape = (m, n))

This first function is a generalized function that creates random boolean matrices with non-zero elements in every column at random row locations. It can be used to create all the "sub-matrices" for each population of selective/encoding neurons and also for the other neurons.

I tried to make a function that has the population you want and I think I did so. But you should check it carefully to make sure the population structure is really want you want. I may have gotten the rows and columns (sources and sinks) backwards, for example. Even if there are errors, I think this code serves to illustrate the approach:

def generate_network(n_e = 8000, p = 5, f = 0.1, c = 0.2):
 # Generate a matrix of connections for a neural network as described in at Code Review
 # http://codereview.stackexchange.com/questions/129645/assembling-edges-of-a-graph
 # unpack parameters
 num_selec_cnxns_per_pop = int(c * f * n_e)
 num_selec_cnxns = int(num_selec_cnxns_per_pop * p)
 num_nonselec_cnxns = int(c * (1 - f*p) * n_e)
 num_cnxns = num_selec_cnxns + num_nonselec_cnxns
 num_encoders_per_pop = int(f * n_e / p)
 num_encoders = int(num_encoders_per_pop * p)
 num_nonselec = n_e - num_encoders
 # make the encoder matrices
 encoder_connectivity_mat = sparse.hstack(
 [rand_col_sparse_bool_arr(n_e, 
 num_encoders_per_pop, 
 num_selec_cnxns_per_pop) for _ in xrange(p)])
 # make the nonselective matrix
 nonselective_connectivity_mat = rand_col_sparse_bool_arr(n_e, 
 num_nonselec,
 num_nonselec_cnxns)
 # stack the matrices together
 connectivity_mat = sparse.hstack([encoder_connectivity_mat, 
 nonselective_connectivity_mat])
 # check the size of the output
 assert connectivity_mat.shape == (n_e, n_e)
 # derive and return connectivity in requested format
 ee_i = connectivity_mat.col
 ee_j = connectivity_mat.row
 return ee_i, ee_j

This approach relies on a predefined ordering of the different nodes. The first rows are the first population of selective neurons, etc., and the final rows are the non-selective neurons.

This approach took about 0.25 seconds for your parameters on my machine, compared to more than five minutes for your original code.

Question 3

as far as I could see by googling np.concatenate is the fastest way for concatenate arrays, so I'd keep that.

but talking about code organization, I suggest you to create a function (something like it. it may have syntax errors, so that's just to give you an idea)

def randomChoice(value, select):
 if select:
 return np.random.choice(encoders[value, :], int(c * f * N_E), False);
 else:
 return np.random.choice(non_selec, int(c * (1 - f * p) * N_E), False);

and have a simple and single call instead of a huge block of code into one line with two concatenate calls

ee_i = np.concatenate([ee_i, randomChoice(0, true), randomChoice(1, true), randomChoice(2, true), randomChoice(3, true), randomChoice(4, true), randomChoice(0, false)])

I hope this helps

Question 4

Nice first answer. I edited your code so that it won't error when you try to run it for incorrect tabbing. If you dislike it feel free to rollback my edit, :)

Question 5

Thanks Joe, I'll take a look later, but I'm glad thay you did it

Question 6

i agree -- nice contribution, awdk. however, i don't know that this will do much to improve speed. maybe there's a more radical reorganization that would allow me to get the information i need faster.

Question 7

@dbliss, it seems that Curt's suggestion answer your question regarding running it faster (pre-allocating the array instead of creating a new one each loop iteration)

Question 8

I just added a major update to my answer which improved speed even more.

Curt F. Curt F.Curt F. 1,65611 silver badges22 bronze badges · Accepted Answer · 2016-05-30 07:07:10Z

Thanks for an interesting question. My review starts with your main interest -- speeding up what you've already got. It then goes to discuss a number of other topics you may be interested in.

Speedup

To figure out what's slow in your code, you've got to profile it. Python make this easy with the line_profiler package.
I used this tool on your code and found that the np.concatenate was responsible for 88% of the run time.

After thinking about it, this didn't surprise me. The way you build ee_i is very inefficient. Keep in mind the output of np.concatenate() is a new array. So every loop iteration requires instantiating a new NumPy array and copying all the existing ee_i data into it.

A better way is to pre-allocate the ee_i array with something like this:
```
ee_j = np.repeat(np.arange(num_neurons), 
 num_cnxns)
ee_i = np.zeros_like(ee_j, dtype = 'int')
### REST OF CODE ###
```
Then, instead of copying all of the existing ee_i data into a new array every time you go through the loop, just update the relevant positions of ee_i using NumPy slice notation.
```
ee_i[start_idx:stop_idx] = ##<code that supplies new values>##
```
This sped up the code by>5-fold for me.

Other issues

Why encoders = np.random.choice(np.arange(N_E), (p, int(f * N_E)), False) instead of just arbitrarily specificying that the first f * N_E / p neurons are part of the first population of encoders, the next, from f * N_E /p + 1 to 2 * f * N_E /p, the second population of encoders, etc. Since the number and ordering neurons is (presumably) arbitrary, it shouldn't matter if you number them in a way that's convenient for you.
It's tough to understand your variable names. What is f? What is c? It would be nice to have a docstring that explained at least this much.

A different (faster) approach

Conceptually, the connections between neurons can be thought of as a square matrix of boolean values. If there is a connection from neuron i to neuron j, then this matrix will have a True at position (i, j). This is a much different representation than the array (i.e. list) of sources and targets you are trying to generate. I find it much more intuitive.

The downside of the matrix representation is that many of the entries will be 0, and it requires creating a very large matrix, which is memory (and sometimes speed) inefficient. Thus, the lists of sources and targets you are trying to create are much more efficient.

However sparse matrices offer a way to get the intuitive advantages of the matrix representation with the speed and memory properties of the "linked list" representation you are shooting for. In particular SciPy's COO sparse matrix format has an underlying representation that is nearly exactly the same as your list of target nodes and source nodes (i.e. a list of the column indices and a list of the row indices with nonzero values).

Here's a version of that approach:

from scipy import sparse
import numpy as np
def rand_col_sparse_bool_arr(m, n, c):
 """
 Returns a sparse boolean matrix (COO format) of size m by n with a total of c nonzero entries per column
 :param: m int number of rows in output matrix
 :param: n int number of columns in output matrix; every column has c nonzero entries
 :param: c int number of nonzero entries in each column of output matrix
 """
 if c > m:
 raise(ValueError, 'c must be less than m.')
 data = np.repeat([True], repeats = c*n)
 row_idxs = np.random.randint(m, size = c*n)
 col_idxs = np.repeat(np.arange(n), repeats = c)
 # return the sparse matrix
 return sparse.coo_matrix((data, (row_idxs, col_idxs)), 
 dtype = bool, 
 shape = (m, n))

This first function is a generalized function that creates random boolean matrices with non-zero elements in every column at random row locations. It can be used to create all the "sub-matrices" for each population of selective/encoding neurons and also for the other neurons.

I tried to make a function that has the population you want and I think I did so. But you should check it carefully to make sure the population structure is really want you want. I may have gotten the rows and columns (sources and sinks) backwards, for example. Even if there are errors, I think this code serves to illustrate the approach:

def generate_network(n_e = 8000, p = 5, f = 0.1, c = 0.2):
 # Generate a matrix of connections for a neural network as described in at Code Review
 # http://codereview.stackexchange.com/questions/129645/assembling-edges-of-a-graph
 # unpack parameters
 num_selec_cnxns_per_pop = int(c * f * n_e)
 num_selec_cnxns = int(num_selec_cnxns_per_pop * p)
 num_nonselec_cnxns = int(c * (1 - f*p) * n_e)
 num_cnxns = num_selec_cnxns + num_nonselec_cnxns
 num_encoders_per_pop = int(f * n_e / p)
 num_encoders = int(num_encoders_per_pop * p)
 num_nonselec = n_e - num_encoders
 # make the encoder matrices
 encoder_connectivity_mat = sparse.hstack(
 [rand_col_sparse_bool_arr(n_e, 
 num_encoders_per_pop, 
 num_selec_cnxns_per_pop) for _ in xrange(p)])
 # make the nonselective matrix
 nonselective_connectivity_mat = rand_col_sparse_bool_arr(n_e, 
 num_nonselec,
 num_nonselec_cnxns)
 # stack the matrices together
 connectivity_mat = sparse.hstack([encoder_connectivity_mat, 
 nonselective_connectivity_mat])
 # check the size of the output
 assert connectivity_mat.shape == (n_e, n_e)
 # derive and return connectivity in requested format
 ee_i = connectivity_mat.col
 ee_j = connectivity_mat.row
 return ee_i, ee_j

This approach relies on a predefined ordering of the different nodes. The first rows are the first population of selective neurons, etc., and the final rows are the non-selective neurons.

This approach took about 0.25 seconds for your parameters on my machine, compared to more than five minutes for your original code.

Stack Exchange Network

Assembling edges of a graph

2 Answers 2

Speedup

Other issues

A different (faster) approach

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Hot Network Questions

Assembling edges of a graph

2 Answers 2

Speedup

Other issues

A different (faster) approach

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Related

Hot Network Questions