30Chapter 1
In1999,AndreyLehmanwrotetoamathematiciancolleaguethathehad
the"pleasuretolearnthat‘Weisfeiler-Leman’wasknownandstillcaused
interest."
59
HedidnotlivetoseetheriseofGNNsbasedonhisworkof
fiftyyearsearlier.NordidGeorgeVl
̆
adu ̧tseetherealisationofhisideasin
chemoinformatics, many of which remained on paper during his lifetime.
1.3The ‘Erlangen Programme’ of Deep Learning
Ourhistorical overviewof thegeometricfoundationsof deeplearning hasnow
naturallybroughtustotheblueprintthatunderpins thisbook.TakingConvo-
lutional andGraphNeural Networksastwoprototypicalexamples, at thefirst
glancecompletelyunrelated, wefindseveralcommontraits.First,both operate
ondata(imagesinthecaseofCNNsormoleculesinthecaseofGNNs)thathas
someunderlyinggeometricdomain(respectively,agridoragraph).Second,
in bothcases thetaskshave anaturalnotion ofinvariance (e.g.tothe position
ofanobjectinimageclassification,or thenumberingof atomsina molecule
inchemicalpropertyprediction)thatcanbeformulatedthroughappropriate
symmetry group (translation in the former example and permutation in the lat-
ter). Third, bothCNNs and GNNsincorporate the respective symmetries asan
inductivebiasbymakingtheirlayersinteractappropriatelywiththeactionof
thesymmetrygroupontheinput. InCNNs,itcomesinthe formofconvolu-
tionallayerswhoseoutputtransformsinthesamewayastheinput(wewill
callthispropertytranslation-equivariance,whichisthesameassayingthat
convolution commuteswith theshiftoperator).InGNNs,it assumestheform
of a symmetricmessagepassing function thataggregates theneighbournodes
irrespectiveoftheirorder,andtheoveralloutputofamessagepassinglayer
transformsinthesamewaywiththepermutationoftheinput(permutation-
equivariant).Finally,insomearchitecturalinstances,thedataareprocessed
in a multi-scalefashion;thiscorrespondstopoolinginCNNs(uniformly sub-
sampling the grid thatunderliestheimage) orgraphcoarseningin sometypes
of GNNs.
Overall,thisappearstobeaverygeneralprinciplethatcanbeappliedto
abroadrangeofproblems,typesofdata,andarchitectures.Wewillusethe
GeometricDeepLearningBlueprinttoderivefromfirstprinciplessomeof
themostcommonandpopularneuralnetworkarchitectures(CNNs,GNNs,
LSTMs,DeepSets,Transformers),whichasoftodayconstitutethemajority
ofthedeeplearningtoolkit.AswewillseeinChapters5–9,alloftheabove
canbeobtainedbyanappropriatechoiceofthedomainandtheassociated
symmetry group.
FurtherextensionsoftheBlueprint,suchasincorporatingthesymmetries
ofthedatainadditiontothoseofthedomain,allowtoobtainanewclassof