Commit d3eec2e

committed

notes updates

1 parent c759b45 commit d3eec2eCopy full SHA for d3eec2e

File tree

6 files changed

+145

-75

lines changed

_notes
- neuro
  - comp_neuro.md
- research_ovws
  - ovw_interp.md
  - ovw_llms.md
- stat
  - causal_inference.md
  - time_series.md
assets
- cv_chandan.pdf

6 files changed

+145

-75

lines changed

`‎_notes/neuro/comp_neuro.md`

Lines changed: 52 additions & 37 deletions

Original file line number	Diff line number	Diff line change
`@@ -876,7 +876,7 @@ subtitle: Diverse notes on various topics in computational neuro, data-driven ne`
`876`	`876`	`- whole-brain light sheet imaging`
`877`	`877`	`- voltage-sensitive dyes / voltage imaging`
`878`	`878`	`- adaptive optics`
`879`		`- - fNRIS - like fMRI but cheaper, allows more immobility, slightly worse spatial res`
	`879`	`+ - - fNIRSlike fMRI but cheaper, allows more immobility, slightly worse spatial res`
`880`	`880`	`- oct - noninvasive - can look at retina (maybe find biomarkers of alzheimer's)`
`881`	`881`	`- fiber photometry - optical fiber implanted delivers excitation light`
`882`	`882`
`@@ -918,14 +918,13 @@ subtitle: Diverse notes on various topics in computational neuro, data-driven ne`
`918`	`918`
`919`	`919`	`## datasets`
`920`	`920`
	`921`	`+### language`
`921`	`922`
`922`		`-- EEG`
`923`	`923`
	`924`	`+- EEG`
`924`	`925`	`- [Brennan & Hale, 2019](https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0207741): 33 subjects recorded with EEG, listening to 12 min of a book chapter, no repeated session`
`925`		`-`
`926`	`926`	`- [Broderick et al. 2018](https://www.cell.com/current-biology/pdf/S0960-9822(18)30146-5.pdf): 9–33 subjects recorded with EEG, conducting different speech tasks, no repeated sessions`
`927`		`-- NLP fMRI`
`928`		`-`
	`927`	`+- fMRI`
`929`	`928`	`- A natural language fMRI dataset for voxelwise encoding models ([lebel, ... huth, 2022](https://www.biorxiv.org/content/10.1101/2022.09.22.509104v1.abstract?%3Fcollection=))`
`930`	`929`	`- 8 participants listening to ~6 hours each of the moth radio hour`
`931`	`930`	`- 3 of the particpants have ~20 hours (~95 stories, 33k timepoints)`
`@@ -934,22 +933,24 @@ subtitle: Diverse notes on various topics in computational neuro, data-driven ne`
`934`	`933`	`- [Schoffelen et al. 2019](https://www.nature.com/articles/s41597-019-0020-y): 100 subjects recorded with fMRI and MEG, listening to de-contextualised sentences and word lists, no repeated session`
`935`	`934`	`- [Huth et al. 2016](https://www.nature.com/articles/nature17637) released data from [one subject](https://github.com/HuthLab/speechmodeltutorial)`
`936`	`935`	`- Visual and linguistic semantic representations are aligned at the border of human visual cortex ([popham, huth et al. 2021](https://www.nature.com/articles/s41593-021-00921-6#data-availability)) - compared semantic maps obtained from two functional magnetic resonance imaging experiments in the same participants: one that used silent movies as stimuli and another that used narrative stories ([data link](https://berkeley.app.box.com/s/l95gie5xtv56zocsgugmb7fs12nujpog))`
`937`		`-- NLP MEG datasets`
`938`		`-`
	`936`	`+- MEG datasets`
`939`	`937`	`- MEG-MASC ([gwilliams...king, 2023](https://www.nature.com/articles/s41597-023-02752-5)) - 27 English-speaking subjects MEG, each ~2 hours of story listening, punctuated by random word lists and comprehension questions in the MEG scanner. Usually each subject listened to four distinct fictional stories twice`
`940`		`-`
`941`	`938`	`- WU-Minn human connectome project ([van Essen et al. 2013](https://www.nature.com/articles/s41597-022-01382-7)) - 72 subjects recorded with fMRI and MEG as part of the Human Connectome Project, listening to 10 minutes of short stories, no repeated session`
`942`		`-`
`943`		`- - [Armeni et al. 2022](https://www.nature.com/articles/s41597-022-01382-7): 3 subjects recorded with MEG, listening to 10 h of Sherlock Holmes, no repeated session`
`944`		`-`
`945`		`-- NLP ECoG`
`946`		`-`
	`939`	`+ - [Armeni et al. 2022](https://www.nature.com/articles/s41597-022-01382-7): 3 subjects recorded with MEG, listening to 10 hours of Sherlock Holmes, no repeated session`
	`940`	`+- ECoG`
	`941`	`+ - The "Podcast" ECoG dataset for modeling neural activity during`
	`942`	`+ natural language comprehension ([zada...hasson, 2025](https://www.biorxiv.org/content/10.1101/2025.02.14.638352v1.full.pdf)) - 9 subjects listening to the same story`
	`943`	`+ - 30-min story (1330 total electrodes, ~5000 spoken words (non-unique)) has female interviewer/voiceover and a male speaker, occasionally background music`
	`944`	`+ - contextual word embeddings from GPT-2 XL (middle layer) accounted for most of the variance across nearly all the electrodes tested`
`947`	`945`	`- Brain Treebank: Large-scale intracranial recordings from naturalistic language stimuli ([wang...barbu, 2024](https://arxiv.org/pdf/2411.08343))`
`948`	`946`	`- Some works on this dataset`
`949`	`947`	`- BrainBERT: Self-supervised representation learning for intracranial recordings ([wang...barbu, 2023](https://arxiv.org/abs/2302.14367))`
`950`	`948`	`- Revealing Vision-Language Integration in the Brain with Multimodal Networks ([subramaniam...barbu, 2024](https://arxiv.org/abs/2406.14481))`
`951`	`949`	`- Population Transformer: Learning Population-Level Representations of Neural Activity ([chau...barbu, 2024](https://pmc.ncbi.nlm.nih.gov/articles/PMC11177958/))`
`952`	`950`
	`951`	`+### misc`
	`952`	`+`
	`953`	`+`
`953`	`954`	`- [non-human primate optogenetics datasets](https://osf.io/mknfu/)`
`954`	`955`	`- [vision dsets](https://www.visualdata.io/)`
`955`	`956`	`- MRNet: knee MRI diagnosis`
`@@ -961,6 +962,11 @@ subtitle: Diverse notes on various topics in computational neuro, data-driven ne`
`961`	`962`	`- stringer et al. data`
`962`	`963`
`963`	`964`	`- 10000 neurons from visual cortex`
	`965`	`+`
	`966`	`+- EEG + fMRI simultaneous`
	`967`	`+`
	`968`	`+ - NeuroBOLT data ([li...chang, 2024](https://arxiv.org/abs/2410.05341); code [link](https://drive.google.com/file/d/1s9LzdBx1afGYiGbpi3p-oFh-CnYLyYqM/view?usp=sharing))`
	`969`	`+ - An open-access dataset of naturalistic viewing using simultaneous EEG-fMRI ([telesford...franco, 2023](https://www.nature.com/articles/s41597-023-02458-8))`
`964`	`970`	`- neuropixels probes`
`965`	`971`	`- [10k neurons visual coding](https://portal.brain-map.org/explore/circuits/visual-coding-neuropixels) from allen institute`
`966`	`972`	`- this probe has also been used in [macaques](https://www.cell.com/neuron/pdf/S0896-6273(19)30428-3.pdf)`
`@@ -1012,6 +1018,12 @@ subtitle: Diverse notes on various topics in computational neuro, data-driven ne`
`1012`	`1018`	`- EEG foundation model: Learning Topology-Agnostic EEG Representations with Geometry-Aware Modeling ([yi...dongsheng li, 2023](https://openreview.net/pdf?id=hiOUySN0ub))`
`1013`	`1019`	`- Strong Prediction: Language Model Surprisal Explains Multiple N400 Effects ([michaelov...coulson, 2024](https://direct.mit.edu/nol/article/5/1/107/115605/Strong-Prediction-Language-Model-Surprisal))`
`1014`	`1020`
	`1021`	`+## cross-subject modeling`
	`1022`	`+`
	`1023`	`+- Aligning Brains into a Shared Space Improves their Alignment to Large Language Models ([bhattacharjee, zaida..., hasson, goldstein, nastase, 2024](https://www.biorxiv.org/content/10.1101/2024.06.04.597448v1))`
	`1024`	`+`
	`1025`	`+`
	`1026`	`+`
`1015`	`1027`	`# fMRI`
`1016`	`1028`
`1017`	`1029`	`## language`
`@@ -1041,6 +1053,8 @@ subtitle: Diverse notes on various topics in computational neuro, data-driven ne`
`1041`	`1053`	`- Seminal language-semantics fMRI study ([huth...gallant, 2016](https://www.nature.com/articles/nature17637)) - build mapping of semantic concepts across cortex using word vecs`
`1042`	`1054`	`- Crafting Interpretable Embeddings for Language Neuroscience by Asking LLMs Questions ([benara et al. 2024](https://openreview.net/pdf?id=mxMvWwyBWe))`
`1043`	`1055`	`- A generative framework to bridge data-driven models and scientific theories in language neuroscience ([antonello et al. 2024](https://arxiv.org/abs/2410.00812))`
	`1056`	`+ - Explanations of Deep Language Models Explain Language`
	`1057`	`+ Representations in the Brain ([rahimi...daliri, 2025](https://arxiv.org/pdf/2502.14671)) - build features using attribution methods and find some small perf. improvements in early language areas`
`1044`	`1058`	`- Deep language algorithms predict semantic comprehension from brain activity [(caucheteux, gramfort, & king, facebook, 2022)](https://www.nature.com/articles/s41598-022-20460-9) - predicts fMRI with gpt-2 on the narratives dataset`
`1045`	`1059`	`- GPT‐2 representations predict fMRI response + extent to which subjects understand corresponding narratives`
`1046`	`1060`	`- compared different encoding features: phoneme, word, gpt-2 layers, gpt-2 attention sizes`
`@@ -1063,20 +1077,38 @@ subtitle: Diverse notes on various topics in computational neuro, data-driven ne`
`1063`	`1077`	`- use encoding models to sort thousands of sentences and then show them`
`1064`	`1078`	`- alternatively, use gradient-based modifications to transform a random sentence to elicit larger responses, but this works worse`
`1065`	`1079`	`- surprisal and well- formedness of linguistic input are key determinants of response strength in the language network`
`1066`		`-- decoding models`
`1067`		`- - [Semantic reconstruction of continuous language from non-invasive brain recordings](https://www.biorxiv.org/content/10.1101/2022.09.29.509744v1) (lebel, jain, & huth, 2022) - reconstruct continuous natural language from fMRI`
`1068`		`- - [Decoding speech from non-invasive brain recordings](https://arxiv.org/abs/2208.12266) (defossez, caucheteux, ..., remi-king, 2022)`
`1069`	`1080`	`- multilingual stuff`
`1070`	`1081`	`- Bilingual language processing relies on shared semantic representations that are modulated by each language ([chen...klein, gallant, deniz, 2024](https://www.biorxiv.org/content/10.1101/2024.06.24.600505v1)) - shared semantic representations are modulated by each language`
`1071`	`1082`	`- An investigation across 45 languages and 12 language families reveals a universal language network ([malik-moraleda...fedorenko, 2022](https://www.nature.com/articles/s41593-022-01114-5#data-availability))`
`1072`	`1083`	`- Multilingual Computational Models Reveal Shared Brain Responses to 21 Languages ([gregor de varda, malik-moraleda...tuckute, fedorenko, 2025](https://www.biorxiv.org/content/10.1101/2025.02.01.636044v1))`
`1073`	`1084`	`- Constructed languages are processed by the same brain mechanisms as natural languages ([malik-moraleda...fedorenko, 2023](https://www.biorxiv.org/content/10.1101/2023.07.28.550667v2))`
`1074`	`1085`
`1075`		`-### language decoding`
	`1086`	`+## semantic decoding`
	`1087`	`+`
	`1088`	`+- duality between encoding and decoding (e.g. for probing smth like syntax in LLM)`
	`1089`	`+ - esp. when things are localized like in fMRI`
	`1090`	`+ - Interpreting encoding and decoding models ([kriegerskorte & douglas, 2019](https://arxiv.org/pdf/1812.00278.pdf))`
	`1091`	`+ - Encoding and decoding in fMRI ([naselaris, kay, nishimoto, & gallant, 2011](https://www.sciencedirect.com/science/article/abs/pii/S1053811910010657))`
	`1092`	`+ - Causal interpretation rules for encoding and decoding models in neuroimaging ([weichwald...grosse-wentrup, 2015](https://www.sciencedirect.com/science/article/abs/pii/S105381191500052X))`
	`1093`	`+ - experimental setup can be stimulus-based, if the experimental conditions precede the measured brain states (e.g. podcast listening) or response-based (e.g. prediction of the laterality of a movement from pre-movement brain state features)`
	`1094`	`+ - for stimulus-based experiments, encoding model is the causal direction`
	`1095`	`+- language`
	`1096`	`+ - Semantic reconstruction of continuous language from non-invasive brain recordings ([tang, lebel, jain, & huth, 2023](https://www.nature.com/articles/s41593-023-01304-9)) - reconstruct continuous natural language from fMRI, including to imagined speech`
	`1097`	+ - Brain-to-Text Decoding: A Non-invasive Approach via Typing ([levy...king, 2025](https://scontent.fphl1-1.fna.fbcdn.net/v/t39.2365-6/475464888_600710912891423_9108680259802499048_n.pdf?_nc_cat=102&ccb=1-7&_nc_sid=3c67a6&_nc_ohc=EryvneL7DMcQ7kNvgFI6M7D&_nc_oc=Adi15_Ln_aPZ_nUY7RyiXzmEzdKu0opFDIwv3J7P55siQ-yn-FUdKQ6_H6PZBKiwBiY&_nc_zt=14&_nc_ht=scontent.fphl1-1.fna&_nc_gid=A441zcs56M0HTpo4ZEEWBSk&oh=00_AYAZ7fX4RhYWqMu2aMria3GoOB6uMNIiIciUQzU0vXy3Tw&oe=67AC0C96)) - decode characters typed from MEG/EEG
	`1098`	`+ - From Thought to Action: How a Hierarchy of Neural Dynamics Supports Language Production ([zhang, levy, ...king, 2025](https://ai.meta.com/research/publications/from-thought-to-action-how-a-hierarchy-of-neural-dynamics-supports-language-production/)) - when decoding during typing, first decode phrase, then word, then syllable, then letter`
	`1099`	`+ - [Decoding speech from non-invasive brain recordings](https://arxiv.org/abs/2208.12266) (defossez, caucheteux, ..., remi-king, 2022)`
	`1100`	`+`
	`1101`	`+- vision`
	`1102`	`+ - Decoding the Semantic Content of Natural Movies from Human Brain Activity ([huth...gallant, 2016](https://www.frontiersin.org/journals/systems-neuroscience/articles/10.3389/fnsys.2016.00081/full)) - direct decoding of concepts from movies using hierarchical logistic regression`
	`1103`	`+ - interpreting weights from a decoding model can be tricky, even if if a concept is reflected in the voxel, it may not be uniquely reflected in the voxel and therefore assigned low weight`
	`1104`	`+`
	`1105`	`+ - Reconstructing Visual Experiences from Brain Activity Evoked by Natural Movies ([nishimoto, ..., gallant, 2011](https://www.sciencedirect.com/science/article/pii/S0960982211009377))`
	`1106`	`+ - Brain Decoding: Toward Real Time Reconstruction of Visual Perception ([Benchetrit...king, 2023](https://ai.meta.com/static-resource/image-decoding)) - use MEG to do visual reconstruction`
	`1107`	`+ - Seeing Beyond the Brain: Conditional Diffusion Model with Sparse Masked Modeling for Vision Decoding ([chen et al. 2022](https://arxiv.org/pdf/2211.06956.pdf))`
	`1108`	`+ - Aligning brain functions boosts the decoding of visual semantics in novel subjects ([thual...king, 2023](https://arxiv.org/abs/2312.06467)) - align across subjects before doing decoding`
	`1109`	`+ - A variational autoencoder provides novel, data-driven features that explain functional brain representations in a naturalistic navigation task ([cho, zhang, & gallant, 2023](https://jov.arvojournals.org/article.aspx?articleid=2792546))`
	`1110`	`+ - What's the Opposite of a Face? Finding Shared Decodable Concepts and their Negations in the Brain ([efird...fyshe, 2024](https://arxiv.org/abs/2405.17663)) - build clustering shared across subjects in CLIP space`
`1076`	`1111`
`1077`		`-- Semantic reconstruction of continuous language from non-invasive brain recordings ([tang, lebel, jain, & huth, 2023](https://www.nature.com/articles/s41593-023-01304-9))`
`1078`		-- Brain-to-Text Decoding: A Non-invasive Approach via Typing ([levy...king, 2025](https://scontent.fphl1-1.fna.fbcdn.net/v/t39.2365-6/475464888_600710912891423_9108680259802499048_n.pdf?_nc_cat=102&ccb=1-7&_nc_sid=3c67a6&_nc_ohc=EryvneL7DMcQ7kNvgFI6M7D&_nc_oc=Adi15_Ln_aPZ_nUY7RyiXzmEzdKu0opFDIwv3J7P55siQ-yn-FUdKQ6_H6PZBKiwBiY&_nc_zt=14&_nc_ht=scontent.fphl1-1.fna&_nc_gid=A441zcs56M0HTpo4ZEEWBSk&oh=00_AYAZ7fX4RhYWqMu2aMria3GoOB6uMNIiIciUQzU0vXy3Tw&oe=67AC0C96)) - decode characters typed from MEG/EEG
`1079`		`-- From Thought to Action: How a Hierarchy of Neural Dynamics Supports Language Production ([zhang, levy, ...king, 2025](https://ai.meta.com/research/publications/from-thought-to-action-how-a-hierarchy-of-neural-dynamics-supports-language-production/)) - when decoding during typing, first decode phrase, then word, then syllable, then letter`
`1080`	`1112`
`1081`	`1113`	`## theories of explanation`
`1082`	`1114`
`@@ -1087,15 +1119,6 @@ subtitle: Diverse notes on various topics in computational neuro, data-driven ne`
`1087`	`1119`	`- NeuroQuery, comprehensive meta-analysis of human brain mapping ([dockes, poldrack, ..., yarkonig, suchanek, thirion, & varoquax](https://elifesciences.org/articles/53385)) [[website](https://neuroquery.org/query?text=checkerboard)]`
`1088`	`1120`	`- train on keywords to directly predict weights for each query-expanded keyword and the produce linearly combined brainmap`
`1089`	`1121`
`1090`		`-## vision`
`1091`		`-`
`1092`		`-- [Reconstructing Visual Experiences from Brain Activity Evoked by Natural Movies](https://www.sciencedirect.com/science/article/pii/S0960982211009377) (nishimoto, ..., gallant, 2011)`
`1093`		`-- Brain Decoding: Toward Real Time Reconstruction of Visual Perception ([Benchetrit...king, 2023](https://ai.meta.com/static-resource/image-decoding)) - use MEG to do visual reconstruction`
`1094`		`-- Seeing Beyond the Brain: Conditional Diffusion Model with Sparse Masked Modeling for Vision Decoding ([chen et al. 2022](https://arxiv.org/pdf/2211.06956.pdf))`
`1095`		`-- Aligning brain functions boosts the decoding of visual semantics in novel subjects ([thual...king, 2023](https://arxiv.org/abs/2312.06467)) - align across subjects before doing decoding`
`1096`		`-- A variational autoencoder provides novel, data-driven features that explain functional brain representations in a naturalistic navigation task ([cho, zhang, & gallant, 2023](https://jov.arvojournals.org/article.aspx?articleid=2792546))`
`1097`		`-- What's the Opposite of a Face? Finding Shared Decodable Concepts and their Negations in the Brain ([efird...fyshe, 2024](https://arxiv.org/abs/2405.17663)) - build clustering shared across subjects in CLIP space`
`1098`		`-`
`1099`	`1122`	`## speech`
`1100`	`1123`
`1101`	`1124`	`- Improving semantic understanding in speech language models via brain-tuning ([moussa, klakow, & toneva, 2024](https://arxiv.org/abs/2410.09230))`
`@@ -1545,14 +1568,6 @@ the operations above allow for encoding many normal data structures into a singl`
`1545`	`1568`	`- finding embeddings via DNNs is a sepcial case of this (e.g. might call it "semantic hashing")`
`1546`	`1569`	`- [random projections in the brain](https://www.biorxiv.org/content/biorxiv/early/2017/08/25/180471.full.pdf)....doing locality sensitive hashing (basically nearest neighbors)`
`1547`	`1570`
`1548`		`-## encoding vs decoding`
`1549`		`-`
`1550`		`-- duality between encoding and decoding (e.g. for probing smth like syntax in LLM)`
`1551`		`- - esp. when things are localized like in fMRI`
`1552`		`- - Interpreting encoding and decoding models ([kriegerskorte & douglas, 2019](https://arxiv.org/pdf/1812.00278.pdf))`
`1553`		`- - Encoding and decoding in fMRI ([naselaris, kay, nishimoto, & gallant, 2011](https://www.sciencedirect.com/science/article/abs/pii/S1053811910010657))`
`1554`		`- - Causal interpretation rules for encoding and decoding models in neuroimaging ([weichwald...grosse-wentrup, 2015](https://www.sciencedirect.com/science/article/abs/pii/S105381191500052X))`
`1555`		`-`
`1556`	`1571`	`# neuro-inspired ai (niAI)`
`1557`	`1572`
`1558`	`1573`	`## neuro-dl reviews`

`‎_notes/research_ovws/ovw_interp.md`

Lines changed: 2 additions & 1 deletion

Original file line number	Diff line number	Diff line change
`@@ -482,7 +482,8 @@ Symbolic regression learns a symbolic (e.g. a mathematical formula) for a functi`
`482`	`482`	`- Knowledge-enhanced Bottlenecks (KnoBo) - A Textbook Remedy for Domain Shifts: Knowledge Priors for Medical Image Analysis ([yang...yatskar, 2024](https://yueyang1996.github.io/papers/knobo.pdf)) - CBMs that incorporate knowledge priors that constrain it to reason with clinically relevant factors found in medical textbooks or PubMed`
`483`	`483`	`- Crafting Interpretable Embeddings by Asking LLMs Questions ([benara...gao, 2024](https://arxiv.org/pdf/2405.16714))`
`484`	`484`	`- Tree-Based Leakage Inspection and Control in Concept Bottleneck Models ([ragkousis & parbhoo, 2024](https://arxiv.org/abs/2410.06352)) - investigate where soft version of a feature outperforms hard version of the feature`
`485`		`- - Bayesian Concept Bottleneck Models with LLM Priors ([feng...tan, 2024](https://arxiv.org/abs/2410.15555))`
	`485`	`+ - BC-LLM: Bayesian Concept Bottleneck Models with LLM Priors ([feng...tan, 2024](https://arxiv.org/abs/2410.15555))`
	`486`	`+ - Towards Achieving Concept Completeness for Unsupervised Textual Concept Bottleneck Models ([bhan...lesot, 2025](https://arxiv.org/abs/2502.11100)) - distill embeddings from a trained model`
`486`	`487`	`- Stochastic Concept Bottleneck Models ([vandenhirtz...vogt, 2024](https://arxiv.org/pdf/2406.19272)) - model covariance between concepts`
`487`	`488`	`- Coarse-to-Fine Concept Bottleneck Models ([panousis...marcos, 2024](https://arxiv.org/pdf/2310.02116))`
`488`	`489`	`- MoIE: Route, Interpret, Repeat: Blurring the Line Between Post hoc Explainability and Interpretable Models ([ghosh, ..., batmangehelich, 2023](https://arxiv.org/abs/2302.10289#)) - mixture of different interpretable models, with black-box routing`

0 commit comments

Comments

(0)

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Commit d3eec2e

File tree

6 files changed

6 files changed

`‎_notes/neuro/comp_neuro.md`

`‎_notes/research_ovws/ovw_interp.md`

0 commit comments