publications | Dong-Kyum Kim

2024

Nat. Comm.
Spontaneous emergence of rudimentary music detectors in deep neural networks

Gwangsu Kim, Dong-Kyum Kim, and Hawoong Jeong

Nature Communications 2024

Abs Bib HTML

Music exists in almost every society, has universal acoustic features, and is processed by distinct neural circuits in humans even with no experience of musical training. However, it remains unclear how these innate characteristics emerge and what functions they serve. Here, using an artificial deep neural network that models the auditory information processing of the brain, we show that units tuned to music can spontaneously emerge by learning natural sound detection, even without learning music. The music-selective units encoded the temporal structure of music in multiple timescales, following the population-level response characteristics observed in the brain. We found that the process of generalization is critical for the emergence of music-selectivity and that music-selectivity can work as a functional basis for the generalization of natural sound, thereby elucidating its origin. These findings suggest that evolutionary adaptation to process natural sounds can provide an initial blueprint for our sense of music.
@article{kim2024music, author = {Kim, Gwangsu and Kim, Dong-Kyum and Jeong, Hawoong}, title = {Spontaneous emergence of rudimentary music detectors in deep neural networks}, year = {2024}, volume = {15}, issue = {148}, journal = {Nature Communications} }
IJCV
SUBTLE: An Unsupervised Platform with Temporal Link Embedding that Maps Animal Behavior

Jea Kwon, Sunpil Kim, Dong-Kyum Kim, Jinhyeong Joo, SoHyung Kim, Meeyoung Cha, and C. Justin Lee

International Journal of Computer Vision 2024

Abs Bib HTML

While huge strides have recently been made in language-based machine learning, the ability of artificial systems to comprehend the sequences that comprise animal behavior has been lagging behind. In contrast, humans instinctively recognize behaviors by finding similarities in behavioral sequences. Here, we develop an unsupervised behavior-mapping framework, SUBTLE (spectrogram-UMAP-based temporal-link embedding), to capture comparable behavioral repertoires from 3D action skeletons. To find the best embedding method, we devise a temporal proximity index (TPI) as a new metric to gauge temporal representation in the behavioral embedding space. The method achieves the best TPI score compared to current embedding strategies. Its spectrogram-based UMAP clustering not only identifies subtle inter-group differences but also matches human-annotated labels. SUBTLE framework automates the tasks of both identifying behavioral repertoires like walking, grooming, standing, and rearing, and profiling individual behavior signatures like subtle inter-group differences by age. SUBTLE highlights the importance of temporal representation in the behavioral embedding space for human-like behavioral categorization.
@article{Kwon2024, author = {Kwon, Jea and Kim, Sunpil and Kim, Dong-Kyum and Joo, Jinhyeong and Kim, SoHyung and Cha, Meeyoung and Lee, C. Justin}, title = {SUBTLE: An Unsupervised Platform with Temporal Link Embedding that Maps Animal Behavior}, year = {2024}, doi = {10.1007/s11263-024-02072-0}, journal = {International Journal of Computer Vision}, }

2023

NeurIPS
Transformer as a hippocampal memory consolidation model based on NMDAR-inspired nonlinearity

Dong-Kyum Kim, Jea Kwon, Meeyoung Cha, and C. Justin Lee

In Thirty-seventh Conference on Neural Information Processing Systems 2023

Abs Bib HTML

The hippocampus plays a critical role in learning, memory, and spatial representation, processes that depend on the NMDA receptor (NMDAR). Here we build on recent findings comparing deep learning models to the hippocampus and develop a new nonlinear activation function based on NMDAR dynamics. We find that NMDAR-like nonlinearity is essential for shifting short-term working memory into long-term reference memory in transformers, thus enhancing a process that resembles memory consolidation in the mammalian brain. We design a navigation task assessing these two memory functions and show that manipulating the activation function (i.e., mimicking the Mg^2+-gating of NMDAR) disrupts long-term memory processes. Our experiments suggest that place cell-like functions and reference memory reside in the feed-forward network layer of transformers and that nonlinearity drives these processes. We discuss the role of NMDAR-like nonlinearity in establishing this striking resemblance between transformer architecture and hippocampal spatial representation.
@inproceedings{kim2023nmda, author = {Kim, Dong-Kyum and Kwon, Jea and Cha, Meeyoung and Lee, C. Justin}, title = {Transformer as a hippocampal memory consolidation model based on NMDAR-inspired nonlinearity}, year = {2023}, url = {https://openreview.net/forum?id=vKpVJxplmB}, booktitle = {Thirty-seventh Conference on Neural Information Processing Systems}, }
PRR
Multidimensional entropic bound: Estimator of entropy production for Langevin dynamics with an arbitrary time-dependent protocol

Sangyun Lee, Dong-Kyum Kim, Jong-Min Park, Won Kyu Kim, Hyunggyu Park, and Jae Sung Lee

Phys. Rev. Research Mar 2023

Abs Bib HTML

Entropy production (EP) is a key quantity in thermodynamics, and yet measuring EP has remained a challenging task. Here we introduce an EP estimator, called multidimensional entropic bound (MEB), utilizing an ensemble of trajectories. The MEB can accurately estimate the EP of overdamped Langevin systems with an arbitrary time-dependent protocol. Moreover, it provides a unified platform to accurately estimate the EP of underdamped Langevin systems under certain conditions. In addition, the MEB is computationally efficient because optimization is unnecessary. We apply our developed estimator to three physical systems driven by time-dependent protocols pertaining to experiments using optical tweezers: A dragged Brownian particle, a pulling process of a harmonic chain, and an unfolding process of an RNA hairpin. Numerical simulations confirm the validity and efficiency of our method.
@article{PhysRevResearch.5.013194, title = {Multidimensional entropic bound: Estimator of entropy production for Langevin dynamics with an arbitrary time-dependent protocol}, author = {Lee, Sangyun and Kim, Dong-Kyum and Park, Jong-Min and Kim, Won Kyu and Park, Hyunggyu and Lee, Jae Sung}, journal = {Phys. Rev. Research}, volume = {5}, issue = {1}, pages = {013194}, numpages = {14}, year = {2023}, month = mar, publisher = {American Physical Society}, doi = {10.1103/PhysRevResearch.5.013194}, url = {https://link.aps.org/doi/10.1103/PhysRevResearch.5.013194}, }
BigComp
Neural Classification of Terrestrial Biomes

Vyacheslav Shen, Dong-Kyum Kim, Elke Zeller, and Meeyoung Cha

In IEEE International Conference on Big Data and Smart Computing Mar 2023

Abs Bib HTML

Predicting vegetation changes under climate change is crucial because it will alter the distribution of different plants and have repercussions for ecosystems. To detect changes in vegetation, we employ biome classification that assigns vegetation distributions to specific biomes. Conventional methods have used empirical formulas or simple vegetation models. Based on previous research that showed the use of convolutional neural networks (CNN), this work employs multiple deep models to classify biomes with the goal of predicting future changes. Experiments over multiple datasets demonstrate that Transformer models can be a suitable alternative to the CNN model. In addition, we observe that the use of additional climate variables helps improve the prediction accuracy without overfitting the data, which previous studies have not considered. We discuss the future directions of machine learning for biome classification as a complement to traditional biome classification methods.
@inproceedings{shen2023biome, author = {Shen, Vyacheslav and Kim, Dong-Kyum and Zeller, Elke and Cha, Meeyoung}, title = {Neural Classification of Terrestrial Biomes}, year = {2023}, booktitle = {IEEE International Conference on Big Data and Smart Computing}, }

2022

NeurIPS-W

Transformer needs NMDA receptor nonlinearity for long-term memory

Dong-Kyum Kim, Jea Kwon, Meeyoung Cha, and C. Justin Lee

In NeurIPS 2022 Memory in Artificial and Real Intelligence workshop Mar 2022

Abs PDF Video Poster Slides

The NMDA receptor (NMDAR) in the hippocampus is essential for learning and memory. We find an interesting resemblance between deep models’ nonlinear activation function and the NMDAR’s nonlinear dynamics. In light of a recent study that compared the transformer architecture to the formation of hippocampal memory, this paper presents new findings that NMDAR-like nonlinearity may be essential for consolidating short-term working memory into long-term reference memory. We design a navigation task assessing these two memory functions and show that manipulating the activation function (i.e., mimicking the Mg^2+-gating of NMDAR) disrupts long-term memory formation. Our experimental data suggest that the concept of place cells and reference memory may reside in the feed-forward network and that nonlinearity plays a key role in these processes. Our findings propose that the transformer architecture and hippocampal spatial representation resemble by sharing the overlapping concept of NMDAR nonlinearity.
PRR
Estimating entropy production with odd-parity state variables via machine learning

Dong-Kyum Kim, Sangyun Lee, and Hawoong Jeong

Phys. Rev. Research Apr 2022

Abs Bib HTML Code

Entropy production (EP) is a central measure in nonequilibrium thermodynamics, as it can quantify the irreversibility of a process as well as its energy dissipation in special cases. Using the time-reversal asymmetry in a system’s path probability distribution, many methods have been developed to estimate EP from only trajectory data. However, for systems with odd-parity variables that prevail in nonequilibrium systems, EP estimation via machine learning has not been covered. In this study, we develop a machine-learning method for estimating the EP in a stochastic system with odd-parity variables through multiple neural networks, which enables us to measure EP with only trajectory data and parity information. We demonstrate our method with two systems, an underdamped bead-spring model and a one-particle odd-parity Markov jump process.
@article{PhysRevResearch.4.023051, title = {Estimating entropy production with odd-parity state variables via machine learning}, author = {Kim, Dong-Kyum and Lee, Sangyun and Jeong, Hawoong}, journal = {Phys. Rev. Research}, volume = {4}, issue = {2}, pages = {023051}, numpages = {7}, year = {2022}, month = apr, publisher = {American Physical Society}, doi = {10.1103/PhysRevResearch.4.023051}, }
PRR
Inferring dissipation maps from videos using convolutional neural networks

Youngkyoung Bae, Dong-Kyum Kim, and Hawoong Jeong

Phys. Rev. Research Aug 2022

Abs Bib HTML Code

In the study of living organisms at mesoscopic scales, attaining a measure of dissipation or entropy production (EP) is essential to gain an understanding of their nonequilibrium dynamics. However, when tracking the relevant variables is impractical, it is challenging to figure out where and to what extent dissipation occurs from recorded time-series images from experiments. In this paper we develop an estimator that can, without detailed knowledge of the given systems, quantify the stochastic EP and produce a spatiotemporal pattern of the EP (or dissipation map) from videos through an unsupervised learning algorithm. Applying a convolutional neural network (CNN), our estimator allows us to visualize where the dissipation occurs as well as its time evolution in a video by looking at an attention map of the CNN’s last layer. We demonstrate that our estimator accurately measures the stochastic EP and provides a locally heterogeneous dissipation map, which is mainly concentrated in the origins of a nonequilibrium state, from generated Brownian videos of various models. We further confirm high performance even with noisy, low-spatial-resolution data and partially observed situations. Our method will provide a practical way to obtain dissipation maps and ultimately contribute to uncovering the source and the dissipation mechanisms of complex nonequilibrium phenomena.
@article{PhysRevResearch.4.033094, title = {Inferring dissipation maps from videos using convolutional neural networks}, author = {Bae, Youngkyoung and Kim, Dong-Kyum and Jeong, Hawoong}, journal = {Phys. Rev. Research}, volume = {4}, issue = {3}, pages = {033094}, numpages = {9}, year = {2022}, month = aug, publisher = {American Physical Society}, doi = {10.1103/PhysRevResearch.4.033094}, }

2021

PRR
Deep reinforcement learning for feedback control in a collective flashing ratchet

Dong-Kyum Kim, and Hawoong Jeong

Phys. Rev. Research Apr 2021

Abs Bib HTML Code

A collective flashing ratchet transports Brownian particles using a spatially periodic, asymmetric, and time-dependent on-off switchable potential. The net current of the particles in this system can be substantially increased by feedback control based on the particle positions. Several feedback policies for maximizing the current have been proposed, but optimal policies have not been found for a moderate number of particles. Here, we use deep reinforcement learning (RL) to find optimal policies, with results showing that policies built with a suitable neural network architecture outperform the previous policies. Moreover, even in a time-delayed feedback situation where the on-off switching of the potential is delayed, we demonstrate that the policies provided by deep RL provide higher currents than the previous strategies.
@article{PhysRevResearch.3.L022002, title = {Deep reinforcement learning for feedback control in a collective flashing ratchet}, author = {Kim, Dong-Kyum and Jeong, Hawoong}, journal = {Phys. Rev. Research}, volume = {3}, issue = {2}, pages = {L022002}, numpages = {6}, year = {2021}, month = apr, publisher = {American Physical Society}, doi = {10.1103/PhysRevResearch.3.L022002}, }

2020

PRL
Learning Entropy Production via Neural Networks

Dong-Kyum Kim, Youngkyoung Bae, Sangyun Lee, and Hawoong Jeong

Phys. Rev. Lett. Oct 2020

Abs arXiv Bib HTML Code

This Letter presents a neural estimator for entropy production (NEEP), that estimates entropy production (EP) from trajectories of relevant variables without detailed information on the system dynamics. For steady state, we rigorously prove that the estimator, which can be built up from different choices of deep neural networks, provides stochastic EP by optimizing the objective function proposed here. We verify the NEEP with the stochastic processes of the bead spring and discrete flashing ratchet models and also demonstrate that our method is applicable to high-dimensional data and can provide coarse-grained EP for Markov systems with unobservable states.
@article{kim2020neep, title = {Learning Entropy Production via Neural Networks}, author = {Kim, Dong-Kyum and Bae, Youngkyoung and Lee, Sangyun and Jeong, Hawoong}, journal = {Phys. Rev. Lett.}, volume = {125}, issue = {14}, pages = {140604}, numpages = {6}, year = {2020}, month = oct, publisher = {American Physical Society}, doi = {10.1103/PhysRevLett.125.140604}, }
JKPS
Multi-label classification of historical documents by using hierarchical attention networks

Dong-Kyum Kim, Byunghwee Lee, Daniel Kim, and Hawoong Jeong

Journal of the Korean Physical Society Oct 2020

Abs Bib HTML

The quantitative analysis of digitized historical documents has begun in earnest in recent years. Text classification is of particular importance for quantitative historical analysis because it helps to search literature efficiently and to determine the important subjects of a particular age. While numerous historians have joined together to classify large-scale historical documents, consistent classification among individual researchers has not been achieved. In this study, we present a classification method for large-scale historical data that uses a recently developed supervised learning algorithm called the Hierarchical Attention Network (HAN). By applying various classification methods to the Annals of the Joseon Dynasty (AJD), we show that HAN is more accurate than conventional techniques with word-frequency-based features. HAN provides the extent that a particular sentence or word contributes to the classification process through a quantitative value called ’attention’. We extract the representative keywords from various categories by using the attention mechanism and show the evolution of the keywords over the 472-year span of the AJD. Our results reveal that largely two groups of event categories are found in the AJD. In one group, the representative keywords of the categories were stable over long periods while the keywords in the other group varied rapidly, exhibiting repeatedly changing characteristics of the categories. Observing such macroscopic changes of representative words may provide insight into how a particular topic changes over a historical period.
@article{kim2020multi, title = {Multi-label classification of historical documents by using hierarchical attention networks}, author = {Kim, Dong-Kyum and Lee, Byunghwee and Kim, Daniel and Jeong, Hawoong}, journal = {Journal of the Korean Physical Society}, volume = {76}, number = {5}, pages = {368--377}, year = {2020}, publisher = {Springer}, }