Research Retrospectives: An interview with David Ha, Co-founder and CEO of Sakana AI

Google Research
5 Mar 202456:17

TLDRIn this research retrospectives interview, David Ha, co-founder and CEO of Sakana AI, discusses his journey from a managing director at Goldman Sachs to a research scientist at Google Brain. He shares his passion for creating interactive web visualizations and neural network agents, emphasizing the importance of visual tools in interpreting machine learning results. Ha also explores his interest in evolutionary strategies, their potential in complex systems, and the future of research presentation beyond traditional PDFs, advocating for a blend of online interactive elements with academic rigor.

Takeaways

  • ๐Ÿ˜€ David Ha, co-founder and CEO of Sakana AI, started his interactive web visualizations as a hobby, combining his background in engineering with his fascination for AI.
  • ๐ŸŒ He emphasizes the importance of visual tools in communicating research findings, especially in the browser, and how it aids in interpreting machine learning results beyond just numbers.
  • ๐ŸŽจ David's early projects involved creating neural network agents in the browser, including a clone of a slime volleyball game controlled by these agents, highlighting the joy of visual and interactive learning.
  • ๐Ÿ“ˆ His work on evolving neural networks to generate images and fake Chinese characters reflects the evolution of AI from being new and exciting to becoming more integrated and expected in the field.
  • ๐Ÿ”ฌ David's transition from finance to research was driven by a desire for a change and an opportunity with Google's Brain Residency program, showing that career paths can be diverse and influenced by personal interests and opportunities.
  • ๐Ÿค– He discusses the potential of neuroevolution, particularly in reinforcement learning, where evolution strategies can offer simplicity and scalability in solving complex problems.
  • ๐ŸŒŸ The interview touches on the significance of open-source tools like EvoJax, which can help different research communities collaborate more effectively by providing a common programming language.
  • ๐Ÿ‘จโ€๐Ÿ‘ฉโ€๐Ÿ‘งโ€๐Ÿ‘ฆ David shares insights on the influence of having children on his research vision, noting how observing their play and exploration can inspire new ideas in AI.
  • ๐Ÿ•’ Time management as a researcher with a family involves finding moments of quiet and utilizing tools like Audible to multitask and stay informed.
  • ๐ŸŒ Reflecting on living and working in different cultures, David appreciates the openness of Canada and the US, the cultural diversity of the UK, and the focus that can come from working in more isolated environments like Japan.
  • ๐Ÿ” Looking ahead, David sees complex systems as a key area for future breakthroughs in AI, suggesting that understanding these systems could lead to more adaptive and resilient machine learning models.

Q & A

  • What is David Ha's background before he joined Google?

    -David Ha was a research scientist at Google working in the Brain team in Japan. Prior to Google, he worked at Goldman Sachs as a managing director where he ran the fixed income trading business in Japan. He has undergraduate and graduate degrees in engineering, science, and applied math from the University of Toronto.

  • How did David Ha's interest in interactive web visualizations begin?

    -David Ha's interest in interactive web visualizations began as a hobby around six or seven years ago. He was bored with his previous job and wanted to do some programming. He learned a visual programming language called processing which worked with JavaScript, and started creating simple neural network experiments and hosting them on a blog.

  • Can you describe one of David Ha's early projects with neural networks?

    -One of David Ha's early projects involved creating a clone of a slime volleyball game from his high school days, but with a twist - it was controlled using neural network agents. He also experimented with evolution, creating slime agents that evolved to avoid killer walls and planks.

  • What was the significance of David Ha's work on generating fake Chinese characters using recurrent neural networks?

    -David Ha's work on generating fake Chinese characters using recurrent neural networks was significant at the time (around seven years ago) because it was a novel application of machine learning that got people excited. Although such techniques may seem trivial now, they were innovative and demonstrated the potential of machine learning for creative tasks.

  • How does David Ha view the role of visualization in communicating research findings?

    -David Ha views visualization as an important discipline for communicating research findings. He believes that visualization, particularly using web browsers, can help interpret machine learning results more effectively than just looking at numbers. By interacting with models and playing with the simulations, one can gain insights that might not be apparent from performance metrics alone.

  • What is David Ha's perspective on the relationship between machine learning and evolutionary strategies?

    -David Ha appreciates both machine learning and evolutionary strategies for their unique strengths. He finds evolutionary strategies particularly useful for simulating things at different levels of complexity without having to define a rigid system. He also sees evolutionary strategies as a source of creativity and a way to define and solve novel problems that traditional machine learning might not address as effectively.

  • How did David Ha's work in finance influence his later research interests?

    -David Ha's work in finance, particularly in derivatives markets and macro trading, gave him a background in quantitative analysis and systems thinking. However, he found the financial markets to be chaotic and unpredictable, which led him to seek out more creative and less predictable fields like AI and machine learning.

  • What inspired David Ha to leave finance and pursue research at Google?

    -David Ha left finance due to the stress and monotony of the industry's performance-driven culture. He was interested in pursuing something new and creative, which led him to apply for and join the Google Brain residency program, allowing him to explore his interests in AI and machine learning.

  • Can you explain David Ha's use of Twitter for professional purposes?

    -David Ha uses Twitter as a platform to share his work, engage with the machine learning community, and discover new research. He finds it valuable for compressing ideas into concise messages, organizing thoughts into threads, and promoting his research to a broader audience.

  • What is David Ha's view on the future of academic publishing and the role of interactive web pages?

    -David Ha believes that while traditional PDF papers are not going away, there is a growing trend towards using interactive web pages as a primary medium for presenting research. He sees these pages as supplementary or even as the main material, with PDFs potentially becoming supplementary. He also notes the importance of adapting to new tools and platforms for research communication.

  • How does David Ha approach formulating his research vision and agenda?

    -David Ha formulates his research vision and agenda by setting a long-term thematic vision, such as exploring complex systems, and developing short-term strategies to achieve it. He looks for tasks where existing methods do not perform well and aims to develop new machine learning algorithms that can make progress in these areas.

Outlines

00:00

๐ŸŽจ Introduction and Personal Hobby to AI Visualization

The speaker, David, introduces himself as a research scientist at Google, with a background in complex systems and machine learning. He shares his journey from managing director at Goldman Sachs to exploring AI as a hobby, which led to creating interactive web visualizations. His initial projects involved neural networks and chaotic systems, which he hosted on a personal blog. David's work gained recognition, leading to an exhibition in Korea and his transition from finance to research.

05:00

๐ŸŒ The Evolution of Web Visualizations in Research

David discusses the importance of visualization in research, emphasizing its role in interpreting machine learning results beyond mere numbers. He highlights the shift from static PDFs to interactive web demos and the insights gained from engaging with models. David also shares his experience with neural network agents in games and the integration of evolutionary strategies in his work, leading to publications and a deeper understanding of model capabilities.

10:02

๐Ÿค– Exploration of Evolutionary Algorithms and Machine Learning

The conversation delves into David's fascination with evolutionary strategies, which he sees as a way to approach complex systems with less rigidity than traditional machine learning. He talks about his early experiments with neural network morphology and the influence of his work on neural architecture search. David also explores the generative aspects of machine learning, combining evolutionary concepts with generative models like GANs.

15:02

๐Ÿ”ฌ The Intersection of Evolutionary Strategies and Reinforcement Learning

David shares his views on the potential of evolutionary strategies in reinforcement learning, discussing their simplicity and scalability. He contrasts this with the challenges of credit assignment in RL and the focus on sample efficiency. His interest lies in solving novel problems rather than refining existing solutions, which he believes evolutionary strategies can facilitate effectively.

20:05

๐ŸŒ The Role of Communities in Advancing Research

The discussion highlights the separation and potential collaboration between the neuroevolution, machine learning, and RL communities. David notes the efforts of individuals like Jeff Clune and Ken Stanley in bridging these gaps. He also mentions the importance of conferences like NeurIPS and the role of Google in publishing research that combines evolutionary methods with machine learning.

25:05

๐Ÿ› ๏ธ The Impact of Tools on Research and Collaboration

David talks about 'EvoJax', a library of evolution algorithms implemented in Jax, and its potential to accelerate research pipelines. He emphasizes the importance of having a common programming language to facilitate collaboration across different research communities. The conversation also touches on the challenges of using GPUs for evolutionary algorithms and the opportunities presented by frameworks like Jax.

30:07

๐Ÿฆ Transition from Finance to Research and Lessons Learned

The speaker reflects on his transition from a career in finance to research, driven by a desire for change and the opportunity to explore his interests in AI. He discusses the lessons he learned from the finance industry, including the unpredictability of chaotic systems and the importance of adaptability. David's move to Google Brain as part of the residency program marked a significant shift in his professional journey.

35:08

๐Ÿฆ The Influence of Social Media on Research Dissemination

David shares his thoughts on the role of Twitter in academic research, highlighting its value for sharing ideas and engaging with the scientific community. He contrasts this with other social media platforms and discusses the importance of compressing complex ideas into concise, impactful messages. The conversation also explores the potential of alternative platforms like TikTok for research communication.

40:10

๐ŸŽญ The Future of Academic Publishing and Research Communication

The discussion envisions the future of academic publishing, with a focus on the role of online platforms and visualizations. David suggests that the traditional PDF may become supplementary as interactive web pages become more prevalent. He emphasizes the importance of adapting to new tools for research communication while acknowledging the continued relevance of well-written papers.

45:11

๐ŸŒ Experiences with Living and Researching in Different Cultures

David shares his experiences living in various countries and the impact on his research perspective. He reflects on the openness and cultural diversity of Canada and the UK, the concentrated innovation environment in Japan, and the food and lifestyle in Asia. The conversation also touches on the influence of cultural context on research and the benefits of a global outlook.

50:12

๐Ÿ‘ถ Drawing Inspiration from Family and Everyday Life

The speaker discusses how observing his children play and explore the world has influenced his approach to research, particularly in the area of AI. He highlights the importance of exploration and adaptation in learning, which he sees as a valuable strategy for research. David also shares his thoughts on time management and the challenges of balancing family life with research commitments.

55:14

๐Ÿ—บ๏ธ Formulating a Research Vision and Agenda

David talks about the importance of having a long-term research vision and short-term strategies to achieve it. He shares his interest in complex systems and how he looks for research topics that challenge existing methods. The conversation explores the idea of pursuing novel solutions and the 'anti state-of-the-art' approach to research that focuses on innovation rather than performance metrics.

๐Ÿ” The Potential of Complex Systems in Advancing AI

In the final part of the conversation, David speculates on the future of AI and the potential for breakthroughs from the study of complex systems. He discusses the hierarchical nature of complex systems and the need for AI to move beyond rigid structures to more adaptive and interactive models, drawing parallels with biological and social systems.

Mindmap

Keywords

๐Ÿ’กResearch Scientist

A research scientist is an individual who engages in the discovery and development of new knowledge in a specific scientific field. In the context of the video, David Ha is described as a research scientist at Google, working on complex systems and creative applications of machine learning. His role involves exploring and experimenting with AI, which is central to the video's theme of innovation and scientific exploration.

๐Ÿ’กMachine Learning

Machine learning is a subset of artificial intelligence that allows computers to learn and improve from experience without being explicitly programmed. In the script, David Ha's research includes creative applications of machine learning, such as developing neural network agents in the browser, which demonstrates the practical application of this concept in creating interactive web visualizations.

๐Ÿ’กNeural Network

A neural network is a series of algorithms designed to recognize patterns and process complex data inputs, inspired by the human brain's neural connections. The script mentions David Ha's early projects involving simple neural network experiments, showcasing how these networks can be trained for tasks like controlling a chaotic double pendulum or generating fake Chinese characters.

๐Ÿ’กSelf-Organization

Self-organization refers to the ability of a system to organize itself without external direction, often seen in complex systems and biological processes. David Ha's research in self-organization is highlighted in the script, indicating his interest in how systems can spontaneously form structures or behaviors, which is a key concept in understanding complex systems.

๐Ÿ’กWeb Visualizations

Web visualizations are graphical representations of data or concepts displayed through a web browser. The script discusses David Ha's interactive web visualizations, such as neural network agents and slime volleyball games, which he created to make machine learning concepts more accessible and engaging for a broader audience.

๐Ÿ’กEvolution Strategies

Evolution strategies are a type of black-box optimization algorithm inspired by the process of natural selection. In the video, David Ha expresses his interest in evolution strategies for their ability to solve complex problems without the need to define a rigid system or pipeline, contrasting them with gradient-based methods in machine learning.

๐Ÿ’กInteractive Demos

Interactive demos are applications or programs that allow users to manipulate variables and see immediate effects, providing a hands-on learning experience. The script notes David Ha's creation of interactive demos to communicate research findings, emphasizing the importance of interaction in understanding and discovering new insights from models.

๐Ÿ’กComplex Systems

Complex systems are composed of many interacting parts, often with the ability to self-organize and display emergent properties. David Ha's fascination with complex systems is a recurring theme in the script, as he discusses his research and the potential for these systems to inspire new approaches in machine learning and AI.

๐Ÿ’กNeuroevolution

Neuroevolution is the combination of neural networks and evolutionary algorithms to optimize network structures and weights. The script touches on David Ha's work in neuroevolution, particularly his experiments with CPPNs (Compositional Pattern Producing Networks) for generating high-resolution images, demonstrating the convergence of evolutionary strategies and neural networks.

๐Ÿ’กReinforcement Learning

Reinforcement learning is a type of machine learning where an agent learns to make decisions by taking actions in an environment to maximize some notion of cumulative reward. The script contrasts reinforcement learning with evolution strategies, highlighting David Ha's perspective on the different applications and advantages of each approach in solving problems.

๐Ÿ’กEvoJax

EvoJax is a library that implements state-of-the-art evolution algorithms in Jax, a machine learning framework. The script mentions EvoJax as an example of how tools can accelerate research and enable collaboration between different communities, such as machine learning and evolutionary computation.

Highlights

David Ha's journey from finance to research at Google Brain, highlighting his unique path into AI.

The importance of visual programming and its role in David's early AI projects, including his interactive web visualizations.

David's fascination with neural networks and machine learning, which led him to create engaging browser-based AI experiments.

His innovative approach to using neural networks to control games and simulate complex systems, such as the double pendulum.

The creation of slime volleyball using neural network agents, demonstrating the potential for AI in gaming.

David's exploration of evolutionary strategies in AI and their advantages over traditional machine learning methods.

The significance of visualization in communicating research findings and its impact on the understanding of AI models.

How David's work with neural architecture search influenced the field and contributed to his hiring at Google.

The role of interactive experiments in discovering new insights about AI models and their behavior.

David's perspective on the balance between traditional research papers and modern, interactive presentations of research.

His thoughts on the future of research communication, including the potential of platforms like Twitter and social media.

Insights from David's experience in neuroevolution and its relationship with machine learning and AI.

The development of EvoJax, a library of state-of-the-art evolution algorithms implemented in Jax, and its potential to foster collaboration.

David's views on the integration of complex systems theory into neural networks as a path to future breakthroughs in AI.

His personal approach to finding research topics, focusing on areas where current methods fall short.

The impact of David's work on weight-agnostic neural networks and its contribution to the field of AI.

Reflections on the role of family and personal life in shaping David's research vision and time management strategies.