178Chapter 6
Whileingeneralthedomaincanevolveintimetogetherwiththesignals
onit,itistypicallyassumedthatthedomainiskeptfixedacrossallthet,i.e.
Ω
(t)
=Ω.Here,we will exclusively focus onthiscase, but note that exceptions
arecommon. Socialnetworks areanexample whereoneoftenhastoaccount
forthedomainchangingthroughtime,asnewlinksareregularlycreatedas
wellaserased.Thedomaininthissettingisoftenreferredtoasadynamic
graph (D. Xu et al. 2020; Rossi, Chamberlain, et al. 2020).
Often,theindividualX
(t)
inputswillexhibitusefulsymmetriesandhence
maybenontriviallytreatedbyanyofourpreviouslydiscussedarchitectures.
Somecommonexamplesinclude:videos(Ωisafixedgrid,andsignalsarea
sequence of frames);fMRI scans (Ωis a fixedmesh representing the geometry
of the brain cortex, where different regions are activated at different times as a
responsetopresentedstimuli);andtrafficflownetworks(Ωisafixedgraphrep-
resenting the road network, on which e.g. the average traffic speed is recorded
at various nodes).
Letusassumeanencoderfunctionf(X
(t)
)providinglatentrepresentations
atthelevelofgranularityappropriatefortheproblemandrespectfulofthe
symmetriesoftheinputdomain.Asanexample
17
,considerprocessingvideo
frames:thatis,ateachtimestep,wearegivenagrid-structuredinputrep-
resentedasann×ばつdmatrixX
(t)
,wherenisthenumberofpixels(fixedin
time) anddisthe numberofinputchannels(e.g.d=3forRGBframes). Fur-
ther,weareinterestedinanalysisatthelevelofentireframes,inwhichcase
itisappropriatetoimplementfasatranslationinvariantCNN,outputtinga
k-dimensional representation z
(t)
=f(X
(t)
) of the frame at time-step t.
Wearenowleftwiththetaskofappropriatelysummarisingasequenceof
vectorsz
(t)
acrossallthesteps.Acanonicalwaytodynamicallyaggregate
this informationin away thatrespects thetemporal progressionof inputsand
alsoeasilyallowsforonlinearrivalofnoveldata-points,isusingaRecurrent
Neural Network(RNN).
18
What wewillshow hereisthatRNNsareaninter-
esting geometric architecture tostudy in their own right,since they implement
a rather unusual type of symmetry over the inputs z
(t)
.
SimpleRNNsAteachstep,therecurrentneuralnetworkcomputesanm-
dimensionalsummaryvectorh
(t)
ofalltheinputstepsuptoandincludingt.
This(partial)summaryiscomputedconditionalonthe currentstep’sfeatures
andthepreviousstep’ssummary,throughasharedupdatefunction,R:R
k
×ばつ
R
m
→R
m
, as follows (see Figure 6.5 for a summary):
h
(t)
=R(z
(t)
,h
(t–1)
)(6.80)
and, asboth z
(t)
and h
(t–1)
are flat vector representations, Rmay bemost easily
expressedasasinglefully-connectedneuralnetworklayer(oftenknownas