1. Home
2. Questions
3. AI Assist
4. Tags
5. Challenges
6. Chat
7. Articles
8. Users
9. Companies
11. Communities for your favorite technologies. Explore all Collectives
Stack Internal

Stack Overflow for Teams is now called Stack Internal. Bring the best of human thought and AI automation together at your work.
Try for free Learn more
Bring the best of human thought and AI automation together at your work. Learn more

Initializing Hidden State for GRU RNN using feed forward neural network

Asked 1 year, 10 months ago

Viewed 182 times

I saw a paper where someone was able to initialize the hidden state of a RNN by using a feed forward NN. I was trying to figure out how this could by done but keep getting error messages while developing the model. I have time series data that is at least 100 values for 2000 independent runs. I want the input to be used to both develop the hidden state, and also be used in the RNN.

Currently this is how I was trying to create the model:

from tensorflow.keras import layers, Model
from tensorflow.keras import GRU, Dense
units = 200
N_inputs = 4
N_outputs = 10
inputs = Input(shape = (None, N_inputs))
state_init = layers.Dense(units)(inputs)
GRU_layer = layers.GRU(units = units, input_shape = (None, N_inputs), return_sequences = True)\
(inputs, initial_state = state_init)
outputs = layers.Dense(units = N_outputs)
model = Model(inputs, outputs)

I am getting this error:

ValueError: An 'initial_state' was passed that is not compatible with 'cell.state_size'. Received 'state_spec'=ListWrapper([InputSpec(shape=(None, None, 200), ndim=3)]); however 'cell.state_size is [200]

Is this even possible or do I have to create some custom code for this? Any help would be greatly appreciated.

The paper is here: https://ieeexplore.ieee.org/document/7966138

Improve this question

asked Feb 26, 2024 at 20:21

Justin Tomko's user avatar

Justin Tomko

72 bronze badges

The state must not have a time dimension. Applying the Dense layer to the same sequence as the GRU doesn't make much sense.

xdurch0
– xdurch0

2024年02月27日 07:46:07 +00:00
Commented Feb 27, 2024 at 7:46
The error you're encountering arises because the initial_state argument in the GRU layer expects a tensor with shape (batch_size, units) or a nested list of tensors if the GRU layer is a bidirectional layer. However, in your code, you are passing a tensor with shape (batch_size, None, units) as the initial_state

Priya T
– Priya T

2024年04月22日 05:53:35 +00:00
Commented Apr 22, 2024 at 5:53

Add a comment |

0

Sorted by: Reset to default

Know someone who can answer? Share a link to this question via email, Twitter, or Facebook.

Your Answer

Draft saved

Draft discarded

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking "Post Your Answer", you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

lang-py

CollectivesTM on Stack Overflow

Initializing Hidden State for GRU RNN using feed forward neural network

0

Know someone who can answer? Share a link to this question via email, Twitter, or Facebook.

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Hot Network Questions