Updated on 2024/04/16

写真a

 
HACHIYA Hirotaka
 
Name of department
Faculty of Systems Engineering, Intelligent Informatics
Job title
Associate Professor
Concurrent post
School of Socio-Informatics(Associate Professor)、Informatics Division(Associate Professor)
Mail Address
E-mail address
External link

Degree

  • 博士(工学)

Academic & Professional Experience

  • 2023.04
    -
    Now

    Wakayama   大学院システム工学研究科   Associate Professor

  • 2020.04
    -
    Now

    株式会社サイバーリンクス   技術アドバイザー

  • 2019.09
    -
    2020.03

    株式会社 サイバーリンクス   顧問

  • 2017.06
    -
    Now

    RIKEN   AIP   Visiting Researcher

  • 2017.04
    -
    2023.03

    Wakayama University   Graduate School of Systems Engineering   Lecturer

  • 2015.08
    -
    2017.03

    キヤノン株式会社   主任研究員

  • 2012.07
    -
    2015.07

    キヤノン株式会社   研究員

▼display all

Research Areas

  • Informatics / Intelligent informatics

Classes (including Experimental Classes, Seminars, Graduation Thesis Guidance, Graduation Research, and Topical Research)

  • 2022   Introduction to Artificial Intelligence   Cooperative Development Subjects

  • 2022   Invitation to Artificial Intelligence   Cooperative Development Subjects

  • 2022   Intelligent Informatics Seminar   Specialized Subjects

  • 2022   Graduation Research   Specialized Subjects

  • 2022   Exercises in fundamentals of artificial intelligence   Specialized Subjects

  • 2022   Artificial Intelligence   Specialized Subjects

  • 2022   Data Analysis   Specialized Subjects

  • 2021   Intelligent Informatics Seminar   Specialized Subjects

  • 2021   Exercises in fundamentals of artificial intelligence   Specialized Subjects

  • 2021   Artificial Intelligence   Specialized Subjects

  • 2021   Graduation Research   Specialized Subjects

  • 2021   Graduation Research   Specialized Subjects

  • 2021   Data Analysis   Specialized Subjects

  • 2021   Introductory Seminar in Systems Engineering   Specialized Subjects

  • 2021   Introduction to Artificial Intelligence   Cooperative Development Subjects

  • 2021   Invitation to Artificial Intelligence   Cooperative Development Subjects

  • 2020   Graduation Research   Specialized Subjects

  • 2020   Graduation Research   Specialized Subjects

  • 2020   Exercises in fundamentals of artificial intelligence   Specialized Subjects

  • 2020   Data Analysis   Specialized Subjects

  • 2020   Intelligent Informatics Seminar   Specialized Subjects

  • 2020   Artificial Intelligence   Specialized Subjects

  • 2020   Introduction to Artificial Intelligence   Cooperative Development Subjects

  • 2020   Invitation to Artificial Intelligence   Cooperative Development Subjects

  • 2019   Graduation Research   Specialized Subjects

  • 2019   Artificial Intelligence   Specialized Subjects

  • 2019   Data Analysis   Specialized Subjects

  • 2019   Intelligent Informatics Seminar   Specialized Subjects

  • 2019   Intelligent System Seminar   Specialized Subjects

  • 2018   Graduation Research   Specialized Subjects

  • 2018   Data Analysis   Specialized Subjects

  • 2018   Intelligent Informatics Seminar   Specialized Subjects

  • 2018   Intelligent System Seminar   Specialized Subjects

  • 2018   Introductory Seminar in Systems Engineering   Specialized Subjects

  • 2018   Artificial Intelligence   Specialized Subjects

  • 2017   Data Analysis   Specialized Subjects

  • 2017   Intelligent Informatics Seminar   Specialized Subjects

  • 2017   Intelligent System Seminar   Specialized Subjects

  • 2017   Artificial Intelligence   Specialized Subjects

▼display all

Classes

  • 2022   Systems Engineering Global Seminar Ⅱ   Doctoral Course

  • 2022   Systems Engineering Global Seminar Ⅰ   Doctoral Course

  • 2022   Systems Engineering Advanced Research   Doctoral Course

  • 2022   Systems Engineering Advanced Seminar Ⅱ   Doctoral Course

  • 2022   Systems Engineering Advanced Seminar Ⅰ   Doctoral Course

  • 2022   Systems Engineering Project SeminarⅡB   Master's Course

  • 2022   Systems Engineering Project SeminarⅡA   Master's Course

  • 2022   Systems Engineering Project SeminarⅠB   Master's Course

  • 2022   Systems Engineering Project SeminarⅠA   Master's Course

  • 2022   Systems Engineering SeminarⅡB   Master's Course

  • 2022   Systems Engineering SeminarⅡA   Master's Course

  • 2022   Systems Engineering SeminarⅠB   Master's Course

  • 2022   Systems Engineering SeminarⅠA   Master's Course

  • 2021   Systems Engineering Global Seminar Ⅱ   Doctoral Course

  • 2021   Systems Engineering Global Seminar Ⅰ   Doctoral Course

  • 2021   Systems Engineering Advanced Research   Doctoral Course

  • 2021   Systems Engineering Advanced Seminar Ⅱ   Doctoral Course

  • 2021   Systems Engineering Advanced Seminar Ⅰ   Doctoral Course

  • 2021   Systems Engineering Project SeminarⅡB   Master's Course

  • 2021   Systems Engineering Project SeminarⅡA   Master's Course

  • 2021   Systems Engineering Project SeminarⅠB   Master's Course

  • 2021   Systems Engineering Project SeminarⅠA   Master's Course

  • 2021   Systems Engineering SeminarⅡB   Master's Course

  • 2021   Systems Engineering SeminarⅡA   Master's Course

  • 2021   Systems Engineering SeminarⅠB   Master's Course

  • 2021   Systems Engineering SeminarⅠA   Master's Course

  • 2020   Systems Engineering Global Seminar Ⅱ   Doctoral Course

  • 2020   Systems Engineering Global Seminar Ⅰ   Doctoral Course

  • 2020   Systems Engineering Advanced Research   Doctoral Course

  • 2020   Systems Engineering Advanced Seminar Ⅱ   Doctoral Course

  • 2020   Systems Engineering Advanced Seminar Ⅰ   Doctoral Course

  • 2020   Systems Engineering Project SeminarⅡB   Master's Course

  • 2020   Systems Engineering Project SeminarⅡA   Master's Course

  • 2020   Systems Engineering Project SeminarⅠB   Master's Course

  • 2020   Systems Engineering Project SeminarⅠA   Master's Course

  • 2020   Systems Engineering SeminarⅡB   Master's Course

  • 2020   Systems Engineering SeminarⅡA   Master's Course

  • 2020   Systems Engineering SeminarⅠB   Master's Course

  • 2020   Systems Engineering SeminarⅠA   Master's Course

  • 2019   Systems Engineering Advanced Seminar Ⅰ   Doctoral Course

  • 2019   Systems Engineering Advanced Seminar Ⅰ   Doctoral Course

  • 2019   Systems Engineering Advanced Research   Doctoral Course

  • 2019   Systems Engineering Advanced Research   Doctoral Course

  • 2019   Systems Engineering SeminarⅡB   Master's Course

  • 2019   Systems Engineering SeminarⅡA   Master's Course

  • 2019   Systems Engineering SeminarⅠB   Master's Course

  • 2019   Systems Engineering SeminarⅠA   Master's Course

  • 2019   Systems Engineering Project SeminarⅡB   Master's Course

  • 2019   Systems Engineering Project SeminarⅡA   Master's Course

  • 2019   Systems Engineering Project SeminarⅠB   Master's Course

  • 2019   Systems Engineering Project SeminarⅠA   Master's Course

  • 2018   Systems Engineering Global Seminar Ⅱ   Doctoral Course

  • 2018   Systems Engineering Global Seminar Ⅱ   Doctoral Course

  • 2018   Systems Engineering Advanced Research   Doctoral Course

  • 2018   Systems Engineering Advanced Research   Doctoral Course

  • 2018   Systems Engineering Project SeminarⅡB   Master's Course

  • 2018   Systems Engineering Project SeminarⅡA   Master's Course

  • 2018   Systems Engineering Project SeminarⅠB   Master's Course

  • 2018   Systems Engineering Project SeminarⅠA   Master's Course

  • 2018   Systems Engineering SeminarⅡB   Master's Course

  • 2018   Systems Engineering SeminarⅡA   Master's Course

  • 2018   Systems Engineering SeminarⅠB   Master's Course

  • 2018   Systems Engineering SeminarⅠA   Master's Course

  • 2017   Systems Engineering Global Seminar Ⅱ   Doctoral Course

  • 2017   Systems Engineering Advanced Research   Doctoral Course

  • 2017   Systems Engineering Advanced Research   Doctoral Course

  • 2017   Systems Engineering Advanced Seminar Ⅱ   Doctoral Course

  • 2017   Systems Engineering Advanced Seminar Ⅱ   Doctoral Course

  • 2017   Systems Engineering Advanced Seminar Ⅰ   Doctoral Course

  • 2017   Systems Engineering Advanced Seminar Ⅰ   Doctoral Course

  • 2017   Systems Engineering Project SeminarⅡB   Master's Course

  • 2017   Systems Engineering Project SeminarⅡA   Master's Course

  • 2017   Systems Engineering Project SeminarⅠA   Master's Course

  • 2017   Systems Engineering SeminarⅡB   Master's Course

  • 2017   Systems Engineering SeminarⅡA   Master's Course

  • 2017   Systems Engineering SeminarⅠB   Master's Course

  • 2017   Systems Engineering SeminarⅠA   Master's Course

▼display all

Satellite Courses

  • 2020   Intelligent Information Communication System in Modern Society  

Published Papers

  • Set representative vector and its asymmetric attention-based transformation for heterogeneous set-to-set matching

    Hirotaka Hachiya, Yuki Saito (Part: Lead author )

    Neurocomputing     2024.04  [Refereed]

    DOI

  • Position-dependent partial convolutions for supervised spatial interpolation

    Hirotaka Hachiya, Kotaro Nagayoshi, Asako Iwaki, Takahiro Maeda, Naonori Ueda, Hiroyuki Fujiwara (Part: Lead author )

    Machine Learning with Applications ( Elsevier BV )    100514 - 100514   2023.11  [Refereed]

    DOI

  • Encoder–decoder-based image transformation approach for integrating multiple spatial forecasts

    Hirotaka Hachiya, Yusuke Masumoto, Atsushi Kudo, Naonori Ueda (Part: Lead author )

    Machine Learning with Applications ( Elsevier BV )  12 ( 100473 ) 1 - 11   2023.05  [Refereed]

    DOI

  • Multistream-Based Marked Point Process With Decomposed Cumulative Hazard Functions

    Hirotaka Hachiya, Sujun Hong (Part: Lead author )

    Neural Computation ( MIT Press )  35 ( 4 ) 699 - 726   2023.03  [Refereed]

     View Summary

    Abstract

    When applying a point process to a real-world problem, an appropriate intensity function model should be designed based on physical and mathematical prior knowledge. Recently, a fully trainable deep learning–based approach has been developed for temporal point processes. In this approach, a cumulative hazard function (CHF) capable of systematic computation of adaptive intensity function is modeled in a data-driven manner. However, in this approach, although many applications of point processes generate various kinds of information such as location, magnitude, and depth, the mark information of events is not considered. To overcome this limitation, we propose a fully trainable marked point process method for modeling decomposed CHFs for time and mark prediction using multistream deep neural networks. We demonstrate the effectiveness of the proposed method through experiments with synthetic and real-world event data.

    DOI

  • Multi‐feature subspace representation network for person re‐identification via bird's‐eye view image

    Jiwei Zhang, Haiyuan Wu, Qian Chen, Hirotaka Hachiya

    Computer Animation and Virtual Worlds ( Wiley )    2023.02  [Refereed]

    DOI

  • Simulation of broad-band ground motions with consistent long-period and short-period components using the Wasserstein interpolation of acceleration envelopes

    Tomohisa Okazaki, Hirotaka Hachiya, Asako Iwaki, Takahiro Maeda, Hiroyuki Fujiwara, Naonori Ueda

    Geophysical Journal International ( OXFORD UNIV PRESS )  227 ( 1 ) 333 - 349   2021.10  [Refereed]

     View Summary

    Practical hybrid approaches for the simulation of broad-band ground motions often combine long-period and short-period waveforms synthesized by independent methods under different assumptions for different period ranges, which at times can lead to incompatible time histories and frequency properties. This study explores an approach that generates consistent broad-hand waveforms using past observation records, under the assumption that long-period waveforms can he obtained from physics-based simulations. Specifically, acceleration envelopes and Fourier amplitude spectra are transformed from long-period to short-period using machine learning methods, and they arc combined to produce a broad-band waveform. To effectively obtain the relationship of high-dimensional envelopes from limited amount of data, we (I) l'onnulate the problem as the conversion of probability distributions, which enables the introduction of a metric known as the Wasserstein distance, and (2) embed pairs of longperiod and short-period envelopes into a common latent space to improve the consistency of the entire waveform. An experimental application to a past earthquake demonstrates that the proposed method exhibits superior performance compared to existing methods as well as neural network approaches. In particular, the proposed method reproduces global properties in the time domain, which confirms the effectiveness of the embedding approach as well as the advantage of the Wasserstein distance as a measure of dissimilarity of the envelopes. This method serves as a novel machine learning approach that maintains consistency both in the time-domain and frequency-domain properties of waveforms.

    DOI

  • 3D Faster R-CNNとレーザスキャンとの組み合わせによる特定物体の頑健な距離推定

    八谷大岳、射手矢和真、中村恭之

    計測自動制御学会論文集   55(1)   2019.01  [Refereed]

  • Distance estimation with 2.5D anchors and its application to robot navigation

    Hirotaka Hachiya, Yuki Saito, Kazuma Iteya, Masaya Nomura, Takayuki Nakamura

    ROBOMECH Journal ( Springer Nature )  5 ( 1 )   2018.12  [Refereed]

    DOI

  • Information-Maximization Clustering Based on Squared-Loss Mutual Information.

    Masashi Sugiyama, Gang Niu, Makoto Yamada, Manabu Kimura, Hirotaka Hachiya

    Neural Computation   26 ( 1 ) 84 - 131   2014  [Refereed]

    DOI

  • Computationally Efficient Multi-Label Classification by Least-Squares Probabilistic Classifiers

    Hyunha Nam, Hirotaka Hachiya, Masashi Sugiyama

    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS ( IEICE-INST ELECTRONICS INFORMATION COMMUNICATIONS ENG )  E96D ( 8 ) 1871 - 1874   2013.08  [Refereed]

     View Summary

    Multi-label classification allows a sample to belong to multiple classes simultaneously, which is often the case in real-world applications such as text categorization and image annotation. In multi-label scenarios, taking into account correlations among multiple labels can boost the classification accuracy. However, this makes classifier training more challenging because handling multiple labels induces a high-dimensional optimization problem. In this paper, we propose a scalable multi-label method based on the least-squares probabilistic classifier. Through experiments, we show the usefulness of our proposed method.

    DOI

  • Feature Selection via l(1)-Penalized Squared-Loss Mutual Information

    Wittawat Jitkrittum, Hirotaka Hachiya, Masashi Sugiyama

    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS ( IEICE-INST ELECTRONICS INFORMATION COMMUNICATIONS ENG )  E96D ( 7 ) 1513 - 1524   2013.07  [Refereed]

     View Summary

    Feature selection is a technique to screen out less important features. Many existing supervised feature selection algorithms use redundancy and relevancy as the main criteria to select features. However, feature interaction, potentially a key characteristic in real-world problems, has not received much attention. As an attempt to take feature interaction into account, we propose l(1)-LSMI, an l(1)-regularization based algorithm that maximizes a squared-loss variant of mutual information between selected features and outputs. Numerical results show that l(1)-LSMI performs well in handling redundancy, detecting non-linear dependency, and considering feature interaction.

    DOI

  • Efficient Sample Reuse in Policy Gradients with Parameter-Based Exploration

    Tingting Zhao, Hirotaka Hachiya, Voot Tangkaratt, Jun Morimoto, Masashi Sugiyama

    NEURAL COMPUTATION ( MIT PRESS )  25 ( 6 ) 1512 - 1547   2013.06  [Refereed]

     View Summary

    The policy gradient approach is a flexible and powerful reinforcement learning method particularly for problems with continuous actions such as robot control. A common challenge is how to reduce the variance of policy gradient estimates for reliable policy updates. In this letter, we combine the following three ideas and give a highly effective policy gradient method: (1) policy gradients with parameter-based exploration, a recently proposed policy search method with low variance of gradient estimates; (2) an importance sampling technique, which allows us to reuse previously gathered data in a consistent way; and (3) an optimal baseline, which minimizes the variance of gradient estimates with their unbiasedness being maintained. For the proposed method, we give a theoretical analysis of the variance of gradient estimates and show its usefulness through extensive experiments.

  • Artist Agent: A Reinforcement Learning Approach to Automatic Stroke Generation in Oriental Ink Painting

    Ning Xie, Hirotaka Hachiya, Masashi Sugiyama

    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS ( IEICE-INST ELECTRONICS INFORMATION COMMUNICATIONS ENG )  E96D ( 5 ) 1134 - 1144   2013.05  [Refereed]

     View Summary

    Oriental ink painting, called Sumi-e, is one of the most distinctive painting styles and has attracted artists around the world. Major challenges in Sumi-e simulation are to abstract complex scene information and reproduce smooth and natural brush strokes. To automatically generate such strokes, we propose to model the brush as a reinforcement learning agent, and let the agent learn the desired brush-trajectories by maximizing the sum of rewards in the policy search framework. To achieve better performance, we provide elaborate design of actions, states, and rewards specifically tailored for a Sumi-e agent. The effectiveness of our proposed approach is demonstrated through experiments on Sumi-e simulation.

    DOI

  • Relative Density-Ratio Estimation for Robust Distribution Comparison.

    Makoto Yamada, Taiji Suzuki, Takafumi Kanamori, Hirotaka Hachiya, Masashi Sugiyama

    Neural Computation   25 ( 5 ) 1324 - 1370   2013  [Refereed]

    DOI

  • Multi-Task Approach to Reinforcement Learning for Factored-State Markov Decision Problems

    Jaak Simm, Masashi Sugiyama, Hirotaka Hachiya

    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS ( IEICE-INST ELECTRONICS INFORMATION COMMUNICATIONS ENG )  E95D ( 10 ) 2426 - 2437   2012.10  [Refereed]

     View Summary

    Reinforcement learning (RL) is a flexible framework for learning a decision rule in an unknown environment. However, a large number of samples are often required for finding a useful decision rule. To mitigate this problem, the concept of transfer learning has been employed to utilize knowledge obtained from similar RL tasks. However, most approaches developed so far are useful only in low-dimensional settings. In this paper, we propose a novel transfer learning idea that targets problems with high-dimensional states. Our idea is to transfer knowledge between state factors (e.g., interacting objects) within a single RL task. This allows the agent to learn the system dynamics of the target RL task with fewer data samples. The effectiveness of the proposed method is demonstrated through experiments.

    DOI

  • Importance-weighted least-squares probabilistic classifier for covariate shift adaptation with application to human activity recognition

    Hirotaka Hachiya, Masashi Sugiyama, Naonori Ueda

    NEUROCOMPUTING ( ELSEVIER SCIENCE BV )  80   93 - 101   2012.03  [Refereed]

     View Summary

    Human activity recognition from accelerometer data (e.g., obtained by smart phones) is gathering a great deal of attention since it can be used for various purposes such as remote health-care. However, since collecting labeled data is bothersome for new users, it is desirable to utilize data obtained from existing users. In this paper, we formulate this adaptation problem as learning under covariate shift, and propose a cornputationally efficient probabilistic classification method based on adaptive importance sampling. The usefulness of the proposed method is demonstrated in real-world human activity recognition. (C) 2011 Elsevier B.V. All rights reserved.

    DOI

  • Analysis and improvement of policy gradient estimation

    Tingting Zhao, Hirotaka Hachiya, Gang Niu, Masashi Sugiyama

    NEURAL NETWORKS ( PERGAMON-ELSEVIER SCIENCE LTD )  26   118 - 129   2012.02  [Refereed]

     View Summary

    Policy gradient is a useful model-free reinforcement learning approach, but it tends to suffer from instability of gradient estimates. In this paper, we analyze and improve the stability of policy gradient methods. We first prove that the variance of gradient estimates in the PGPE (policy gradients with parameter-based exploration) method is smaller than that of the classical REINFORCE method under a mild assumption. We then derive the optimal baseline for PGPE, which contributes to further reducing the variance. We also theoretically show that PGPE with the optimal baseline is more preferable than REINFORCE with the optimal baseline in terms of the variance of gradient estimates. Finally, we demonstrate the usefulness of the improved PGPE method through experiments. (c) 2011 Elsevier Ltd. All rights reserved.

    DOI

  • COMPUTATIONALLY EFFICIENT MULTI-LABEL CLASSIFICATION BY LEAST-SQUARES PROBABILISTIC CLASSIFIER

    Hyun Ha Nam, Hirotaka Hachiya, Masashi Sugiyama

    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) ( IEEE )    2077 - 2080   2012  [Refereed]

     View Summary

    Multi-label classification allows a sample to belong to multiple classes simultaneously, which is often the case in real-world applications such as audio tagging, image annotation, video search, and text mining. In such a multi-label scenario, taking into account correlation between multiple labels can boost the classification accuracy. However, this in turn makes classifier training more challenging because handling multiple labels tends to induce a high-dimensional optimization problem. In this paper, we propose a highly scalable multilabel classifier based on a computationally efficient classification algorithm called the least-squares probabilistic classifier. Through experiments, we show the usefulness of our proposed method.

  • Reward-Weighted Regression with Sample Reuse for Direct Policy Search in Reinforcement Learning

    Hirotaka Hachiya, Jan Peters, Masashi Sugiyama

    NEURAL COMPUTATION ( MIT PRESS )  23 ( 11 ) 2798 - 2832   2011.11  [Refereed]

     View Summary

    Direct policy search is a promising reinforcement learning framework, in particular for controlling continuous, high-dimensional systems. Policy search often requires a large number of samples for obtaining a stable policy update estimator, and this is prohibitive when the sampling cost is expensive. In this letter, we extend an expectation-maximization-based policy search method so that previously collected samples can be efficiently reused. The usefulness of the proposed method, reward-weighted regression with sample reuse (R-3), is demonstrated through robot learning experiments. (This letter is an extended version of our earlier conference paper: Hachiya, Peters, & Sugiyama, 2009.)

    DOI

  • On information-maximization clustering: Tuning parameter selection and analytic solution

    Masashi Sugiyama, Makoto Yamada, Manabu Kimura, Hirotaka Hachiya

    Proceedings of the 28th International Conference on Machine Learning, ICML 2011     65 - 72   2011  [Refereed]

     View Summary

    Information-maximization clustering learns a probabilistic classifier in an unsupervised manner so that mutual information between feature vectors and cluster assignments is maximized. A notable advantage of this approach is that it only involves continuous optimization of model parameters, which is substantially easier to solve than discrete optimization of cluster assignments. However, existing methods still involve non-convex optimization problems, and therefore finding a good local optimal solution is not straightforward in practice. In this paper, we propose an alternative information-maximization clustering method based on a squared-loss variant of mutual information. This novel approach gives a clustering solution analytically in a computationally efficient way via kernel eigenvalue decomposition. Furthermore, we provide a practical model selection procedure that allows us to objectively optimize tuning parameters included in the kernel function. Through experiments, we demonstrate the usefulness of the proposed approach. Copyright 2011 by the author(s)/owner(s).

  • Least absolute policy iteration - A robust approach to value function approximation

    Masashi Sugiyama, Hirotaka Hachiya, Hisashi Kashima, Tetsuro Mortmura

    IEICE Transactions on Information and Systems   E93-D ( 9 ) 2555 - 2565   2010.09  [Refereed]

     View Summary

    Least-squares policy iteration is a useful reinforcement learning method in robotics due to its computational e?ciency. However, it tends to be sensitive to outliers in observed rewards. In this paper, we propose an alternative method that employs the absolute loss for enhancing robustness and reliability. The proposed method is formulated as a linear programming problem which can be solved eficiently by standard optimization software, so the computational advantage is not sacrificed for gaining robustness and reliability. We demonstrate the usefulness of the proposed approach through a simulated robot-control task. Copyright © 2010 The Institute of Electronics, Information and Communication Engineers.

    DOI

  • Least Absolute Policy Iteration-A Robust Approach to Value Function Approximation

    Masashi Sugiyama, Hirotaka Hachiya, Hisashi Kashima, Tetsuro Morimura

    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS ( IEICE-INST ELECTRONICS INFORMATION COMMUNICATIONS ENG )  E93D ( 9 ) 2555 - 2565   2010.09  [Refereed]

     View Summary

    Least-squares policy iteration is a useful reinforcement learning method in robotics due to its computational efficiency. However, it tends to be sensitive to outliers in observed rewards. In this paper, we propose an alternative method that employs the absolute loss for enhancing robustness and reliability. The proposed method is formulated as a linear programming problem which can be solved efficiently by standard optimization software, so the computational advantage is not sacrificed for gaining robustness and reliability. We demonstrate the usefulness of the proposed approach through a simulated robot-control task.

    DOI

  • Efficient exploration through active learning for value function approximation in reinforcement learning

    Takayuki Akiyama, Hirotaka Hachiya, Masashi Sugiyama

    NEURAL NETWORKS ( PERGAMON-ELSEVIER SCIENCE LTD )  23 ( 5 ) 639 - 648   2010.06  [Refereed]

     View Summary

    Appropriately designing sampling policies is highly important for obtaining better control policies in reinforcement learning. In this paper, we first show that the least-squares policy iteration (LSPI) framework allows us to employ statistical active learning methods for linear regression. Then we propose a design method of good sampling policies for efficient exploration, which is particularly useful when the sampling cost of immediate rewards is high. The effectiveness of the proposed method, which we call active policy iteration (API), is demonstrated through simulations with a batting robot. (C) 2010 Elsevier Ltd. All rights reserved.

    DOI

  • Parametric return density estimation for Reinforcement Learning

    Tetsuro Morimura, Masashi Sugiyama, Hisashi Kashima, Hirotaka Hachiya, Toshiyuki Tanaka

    Proceedings of the 26th Conference on Uncertainty in Artificial Intelligence, UAI 2010     368 - 375   2010  [Refereed]

     View Summary

    Most conventional Reinforcement Learning (RL) algorithms aim to optimize decision-making rules in terms of the expected re-turns. However, especially for risk management purposes, other risk-sensitive criteria such as the value-at-risk or the expected shortfall are sometimes preferred in real applications. Here, we describe a parametric method for estimating density of the returns, which allows us to handle various criteria in a unified manner. We first extend the Bellman equation for the conditional expected return to cover a conditional probability density of the returns. Then we derive an extension of the TD-learning algorithm for estimating the return densities in an unknown environment. As test instances, several parametric density estimation algorithms are presented for the Gaussian, Laplace, and skewed Laplace distributions. We show that these algorithms lead to risk-sensitive as well as robust RL paradigms through numerical experiments.

  • Least-squares conditional density estimation

    Masashi Sugiyama, Ichiro Takeuchi, Taiji Suzuki, Takafumi Kanamori, Hirotaka Hachiya, Daisuke Okanohara

    IEICE Transactions on Information and Systems   E93-D ( 3 ) 583 - 594   2010  [Refereed]

     View Summary

    Estimating the conditional mean of an input-output relation is the goal of regression. However, regression analysis is not sufficiently informative if the conditional distribution has multi-modality, is highly asymmetric, or contains heteroscedastic noise. In such scenarios, estimating the conditional distribution itself would be more useful. In this paper, we propose a novel method of conditional density estimation that is suitable for multi-dimensional continuous variables. The basic idea of the proposed method is to express the conditional density in terms of the density ratio and the ratio is directly estimated without going through density estimation. Experiments using benchmark and robot transition datasets illustrate the usefulness of the proposed approach. Copyright © 2010 The Institute of Electronics, Information and Communication Engineers.

    DOI

  • Nonparametric return distribution approximation for reinforcement learning

    Tetsuro Morimurat, Masashi Sugiyama, Hisashi Kashima, Hirotaka Hachiya, Toshiyuki Tanaka

    ICML 2010 - Proceedings, 27th International Conference on Machine Learning     799 - 806   2010  [Refereed]

     View Summary

    Standard Reinforcement Learning (RL) aims to optimize decision-making rules in terms of the expected return. However, especially for risk-management purposes, other criteria such as the expected shortfall are some-times preferred. Here, we describe a method of approximating the distribution of returns, which allows us to derive various kinds of information about the returns. We first show that the Bellman equation, which is a recursive formula for the expected return, can be extended to the cumulative return distribution. Then we derive a nonparametric return distribution estimator with particle smooth ing based on this extended Bellman equation. A key aspect of the proposed algorithm is to represent the recursion relation in the extended Bellman equation by a simple replacement procedure of particles associated with a state by using those of the successor state. We show that our algorithm leads to a risk-sensitive R.L paradigm. The usefulness of the proposed approach is demonstrated through numerical experiments. Copyright 2010 by the author(s)/owner(s).

  • Conditional density estimation via least-squares density ratio estimation

    Masashi Sugiyama, Ichiro Takeuchi, Taiji Suzuki, Takafumi Kanamori, Hirotaka Hachiya, Daisuke Okanohara

    Journal of Machine Learning Research   9   781 - 788   2010  [Refereed]

     View Summary

    Estimating the conditional mean of an inputoutput relation is the goal of regression. However, regression analysis is not sufficiently informative if the conditional distribution has multi-modality, is highly asymmetric, or contains heteroscedastic noise. In such scenarios, estimating the conditional distribution itself would be more useful. In this paper, we propose a novel method of conditional density estimation. Our basic idea is to express the conditional density in terms of the ratio of unconditional densities, and the ratio is directly estimated without going through density estimation. Experiments using benchmark and robot transition datasets illustrate the usefulness of the proposed approach. Copyright 2010 by the authors.

  • Feature Selection for Reinforcement Learning: Evaluating Implicit State-Reward Dependency via Conditional Mutual Information

    Hirotaka Hachiya, Masashi Sugiyama

    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT I ( SPRINGER-VERLAG BERLIN )  6321   474 - 489   2010  [Refereed]

     View Summary

    Model-free reinforcement learning (RL) is a machine learning approach to decision making in unknown environments. However, real-world RL tasks often involve high-dimensional state spaces, and then standard RL methods do not perform well. In this paper, we propose a new feature selection framework for coping with high dimensionality. Our proposed framework adopts conditional mutual information between return and state-feature sequences as a feature selection criterion, allowing the evaluation of implicit state-reward dependency. The conditional mutual information is approximated by a least-squares method, which results in a computationally efficient feature selection procedure. The usefulness of the proposed method is demonstrated on grid-world navigation problems.

  • Adaptive importance sampling for value function approximation in off-policy reinforcement learning

    Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiayma, Jan Peters

    Neural Networks   22 ( 10 ) 1399 - 1410   2009.12  [Refereed]

     View Summary

    Off-policy reinforcement learning is aimed at efficiently using data samples gathered from a policy that is different from the currently optimized policy. A common approach is to use importance sampling techniques for compensating for the bias of value function estimators caused by the difference between the data-sampling policy and the target policy. However, existing off-policy methods often do not take the variance of the value function estimators explicitly into account and therefore their performance tends to be unstable. To cope with this problem, we propose using an adaptive importance sampling technique which allows us to actively control the trade-off between bias and variance. We further provide a method for optimally determining the trade-off parameter based on a variant of cross-validation. We demonstrate the usefulness of the proposed approach through simulations. © 2009 Elsevier Ltd. All rights reserved.

    DOI

  • Active Policy Iteration: Efficient Exploration through Active Learning for Value Function Approximation in Reinforcement Learning

    Takayuki Akiyama, Hirotaka Hachiya, Masashi Sugiyama

    21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS ( IJCAI-INT JOINT CONF ARTIF INTELL )    980 - 985   2009  [Refereed]

     View Summary

    Appropriately designing sampling policies is highly important for obtaining better control policies in reinforcement learning. In this paper, we first show that the least-squares policy iteration (LSPI) framework allows us to employ statistical active learning methods for linear regression. Then we propose a design method of good sampling policies for efficient exploration, which is particularly useful when the sampling cost of immediate rewards is high. We demonstrate the usefulness of the proposed method, named active policy iteration (API), through simulations with a batting robot.

  • Efficient Data Reuse in Value Function Approximation.

    Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiyama, Jan Peters

    ADPRL: 2009 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING ( IEEE )    8 - +   2009  [Refereed]

     View Summary

    Off-policy reinforcement learning is aimed at efficiently using data samples gathered from a policy that is different from the currently optimized policy. A common approach is to use importance sampling techniques for compensating for the bias of value function estimators caused by the difference between the data-sampling policy and the target policy. However, existing off-policy methods often do not take the variance of the value function estimators explicitly into account and therefore their performance tends to be unstable. To cope with this problem, we propose using an adaptive importance sampling technique which allows us to actively control the trade-off between bias and variance. We further provide a method for optimally determining the trade-off parameter based on a variant of cross-validation. The usefulness of the proposed approach is demonstrated through simulated swing-up inverted-pendulum problem.

  • Efficient Sample Reuse in EM-Based Policy Search

    Hirotaka Hachiya, Jan Peters, Masashi Sugiyama

    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT I ( SPRINGER-VERLAG BERLIN )  5781   469 - +   2009  [Refereed]

     View Summary

    Direct policy search is a, promising reinforcement learning framework in particular for controlling in continuous, high-dimensional systems such as anthropomorphic robots. Policy search often requires a large number of samples for obtaining a stable policy update estimator due to its high flexibility. However, this is prohibitive when the sampling cost is expensive. Ill this paper, we extend all EM-based policy search method so that previously collected samples call be efficiently reused. The usefulness of the proposed method. called Reward-weighted Regression with. sample Reuse (R-3), is demonstrated through a robot learning experiment.

  • Least Absolute Policy Iteration for Robust Value Function Approximation

    Masashi Sugiyama, Hirotaka Hachiya, Hisashi Kashima, Tetsuro Morimura

    ICRA: 2009 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-7 ( IEEE )    699 - +   2009  [Refereed]

     View Summary

    Least-squares policy iteration is a useful reinforcement learning method in robotics due to its computational efficiency. However, it tends to be sensitive to outliers in observed rewards. In this paper, we propose an alternative method that employs the absolute loss for enhancing robustness and reliability. The proposed method is formulated as a linear programming problem which can be solved efficiently by standard optimization software, so the computational advantage is not sacrificed for gaining robustness and reliability. We demonstrate the usefulness of the proposed approach through simulated robot-control tasks.

  • Geodesic Gaussian kernels for value function approximation

    Masashi Sugiyama, Hirotaka Hachiya, Christopher Towell, Sethu Vijayakumar

    AUTONOMOUS ROBOTS ( SPRINGER )  25 ( 3 ) 287 - 304   2008.10  [Refereed]

     View Summary

    The least-squares policy iteration approach works efficiently in value function approximation, given appropriate basis functions. Because of its smoothness, the Gaussian kernel is a popular and useful choice as a basis function. However, it does not allow for discontinuity which typically arises in real-world reinforcement learning tasks. In this paper, we propose a new basis function based on geodesic Gaussian kernels, which exploits the non-linear manifold structure induced by the Markov decision processes. The usefulness of the proposed method is successfully demonstrated in simulated robot arm control and Khepera robot navigation.

    DOI

  • Adaptive importance sampling with automatic model selection in value function approximation

    Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiyama, Jan Peters

    Proceedings of the National Conference on Artificial Intelligence   3   1351 - 1356   2008  [Refereed]

     View Summary

    Off-policy reinforcement learning is aimed at efficiently reusing data samples gathered in the past, which is an essential problem for physically grounded AI as experiments are usually prohibitively expensive. A common approach is to use importance sampling techniques for compensating for the bias caused by the difference between data-sampling policies and the target policy. However, existing off-policy methods do not often take the variance of value function estimators explicitly into account and therefore their performance tends to be unstable. To cope with this problem, we propose using an adaptive importance sampling technique which allows us to actively control the trade-off between bias and variance. We further provide a method for optimally determining the trade-off parameter based on a variant of cross-validation. We demonstrate the usefulness of the proposed approach through simulations. Copyright © 2008.

  • Value function approximation on non-linear manifolds for robot motor control

    Masashi Sugiyama, Hirotaka Hachiya, Christopher Towell, Sethu Vijayakumar

    PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-10 ( IEEE )    1733 - +   2007  [Refereed]

     View Summary

    The least squares approach works efficiently in value function approximation, given appropriate basis functions. Because of its smoothness, the Gaussian kernel is a popular and useful choice as a basis function. However, it does not allow for discontinuity which typically arises in real-world reinforcement learning tasks. In this paper, we propose a new basis function based on geodesic Gaussian kernels, which exploits the non-linear manifold structure induced by the Markov decision processes. The usefulness of the proposed method is successfully demonstrated in a simulated robot arm control and Khepera robot navigation.

▼display all

Books etc

  • ゼロからつくるPython機械学習プログラミング入門

    八谷大岳( Part: Sole author)

    講談社  2020.08 

  • 強くなるロボティック・ゲームプレイヤーの作り方 プレミアムブック版 : 実践で学ぶ強化学習

    八谷, 大岳, 杉山, 将

    マイナビ出版  2016  ISBN: 9784839956738

  • 強くなるロボティック・ゲームプレイヤーの作り方 : 実践で学ぶ強化学習

    八谷, 大岳, 杉山, 将

    毎日コミュニケーションズ  2008  ISBN: 9784839927417

Misc

  • Frequency-Dependent Image Reconstruction Error for Micro Defect Detection

    Yuhei Nomura, Hirotaka Hachiya (Part: Last author )

    Proceedings of The 15th Asian Conference on Machine Learning (ACML2023)     2023.11

  • 地下構造データに基づく解釈可能な深層inpaintingによる地震動補間

    八谷 大岳, 田羅鋤 祐果, 岩城 麻子, 前田 宜浩, 上田 修功, 藤原 広行

    日本地震工学シンポジウム     2023.11

  • ニューラルネットを用いた1kmメッシュ気温の推定

    北村智文,小林健二,若山郁生,今井崇人,丸山拓海, 上田修功,八谷大岳,高橋温志

    気象学会秋季大会     2023.10

  • 鏡面フローとハイライトに基づく深層特徴による鏡面検出

    吉村 優大, 八谷 大岳 (Part: Last author )

    第26回 画像の認識・理解シンポジウム(MIRU2023), ショートオーラル発表     2023.07  [Refereed]

  • Combining Static Specular Flow and Highlight with Deep Features for Specular Surface Detection

    Hirotaka Hachiya, Yuta Yoshimura (Part: Lead author )

    Proceedings of The 18th INternational Conference on Machine Vision Applications (MVA2023)     2023.07  [Refereed]

  • 周波数依存の画像再構成誤差に基づく極小欠陥検出

    野村侑平, 八谷大岳 (Part: Last author )

    第26回 画像の認識・理解シンポジウム(MIRU2023)     2023.07

  • Position-dependent partial convolutions for supervised spatial interpolation

    Hirotaka Hachiya, Kotaro Nagayoshi, Asako Iwaki, Takahiro Maeda, Naonori Ueda, Hiroyuki Fujiwara (Part: Lead author )

    Proceedings of The 14th Asian Conference on Machine Learning (ACML2022)   189   420 - 435   2022.12  [Refereed]

  • Study on Echocardiographic Image Segmentation Based on Attention U-Net

    Kai Wang, Jiwei Zhang, Hirotaka Hachiya, Haiyuan Wu

    2022 IEEE International Conference on Mechatronics and Automation (ICMA) ( IEEE )    2022.08  [Refereed]

    DOI

  • Attention-based set embedded vector for set-to-set matching

    中村 晟人, 八谷 大岳 (Part: Last author, Corresponding author )

    第25回 画像の認識・理解シンポジウム(MIRU2022)     2022.07

  • Transformer-Based Fully Trainable Model for Point Process with Past Sequence-Representative Vector

    Fumiya Nishizawa, Sujun Hong, Hirotaka Hachiya (Part: Last author, Corresponding author )

    IBISML2022-1   122 ( 90 )   2022.06

  • 局所面形状モデルを用いた球面鏡計測

    秋吉康平, 八谷大岳 (Part: Corresponding author )

    第22回 計測自動制御学会 システムインテグレーション部門講演会     2021.12

  • Multi-stream based Marked Point Process

    Sujun Hong, Hirotaka Hachiya (Part: Corresponding author )

    Proceedings of the 13th Asian Conference on Machine Learning (ACML2021)     2021.11  [Refereed]

  • 集合組み込みベクトルを用いたAttentionベースの順不変集合データマッチング

    中村晟人, 八谷大岳 (Part: Corresponding author )

    第24回情報論的学習理論ワークショップ     2021.11

  • Encoder-decoder-based image transformation approach for integrating precipitation forecasts

    Hirotaka Hachiya, Yusuke Masumoto, Yuki Mori, Naonori Ueda (Part: Lead author )

    Proceedings of the 13th Asian Conference on Machine Learning (ACML2021)     2021.11  [Refereed]

  • 全方位画像における正二十面体メッシュを用いた物体検出

    髙野誉将, 八谷大岳

    第24回 画像の認識・理解シンポジウム(MIRU2021)     2021.07

  • Mark-encoded image-based point process

    Sujun Hong, Hirotaka Hachiya

    第24回 画像の認識・理解シンポジウム(MIRU2021)     2021.07

  • Attention-based classification and segmentation for automatic thyroid nodule recognition and diagnosis

    戚意強, 鈴木祈史, 八谷大岳, 呉海元

    第24回 画像の認識・理解シンポジウム(MIRU2021)     2021.07

  • Encoder-decoder based image transformation approach for integrating precipitation forecasts

    Hirotaka Hachiya, Yusuke Masumoto, Yuki Mori, Naonori Ueda (Part: Lead author )

    第24回 画像の認識・理解シンポジウム(MIRU2021)     2021.07

  • U-Netを用いた時空間的な降 水量ガイダンス統合

    八谷 大岳, 増本悠介, 森 祐貴, 上田 修功 (Part: Lead author )

    日本気象学会2021年度春季大会     2021.05

  • マルチクラスAUC最大化を用いた台風発達予報

    黒良峻平, 八谷大岳, 嶋田宇大, 上田修功

    第23回情報論的学習理論ワークショップ     2020.11

  • Deep Inpaintingと空間分布マッチングの組み合わせによる地震動データの空間補完

    永吉耕太郎, 八谷大岳, 藤原広行, 上田修功, 岩城麻子, 前田宜浩

    第23回情報論的学習理論ワークショップ     2020.11

  • オートエンコーダを用いた時系列解析のための高自由度な面的点過程モデル

    洪秀俊, 八谷大岳

    第23回情報論的学習理論ワークショップ     2020.11

  • Exchangeable Deep Neural Networks for Set-to-Set Matching and Learning

    Yuki Saito, Takuma Nakamura, Hirotaka Hachiya, Kenji Fukumizu

    第23回 画像の認識・理解シンポジウム(MIRU2020)     2020.08  [Refereed]

  • Label-CycleGANを用いたドメイン適応のためのCG実写変換

    永吉耕太郎, 八谷大岳

    第23回 画像の認識・理解シンポジウム(MIRU2020)     2020.08

  • Exchangeable Deep Neural Networks for Set-to-Set Matching and Learning

    Yuki Saito, Takuma Nakamura, Hirotaka Hachiya, Kenji Fukumizu

    Proceedings of the 16th European Conference on Computer Vision (ECCV2020) ( Springer International Publishing )    626 - 646   2020.08  [Refereed]

    DOI

  • Direct Multi-class AUC Maximization for Forecasting Rapidly Intensifying Tropical Cyclones

    Hirotaka Hachiya, Shumpei Kurora, Udai Shimada, Naonori Ueda (Part: Lead author )

    JpGU-AGU Joint Meeting 2020     2020.07  [Refereed]

  • マルチクロスヒンジ損失を用いた不均衡多クラス分類

    黒良峻平, 八谷大岳, 嶋田宇大, 上田修功

    情報論的学習理論と機械学習研究会(IBISML) ( 電子情報通信学会 )    2020.03

  • 機械学習を用いた南海トラフ巨大地震シミュレータの摩擦パラメータ推定

    山本友, 平原和朗,八谷大岳, 上田修功

    固体地球科学データ同化に関する研究会     2020.02

  • Adaptive truncated residuals regression for fine-grained regression problems

    Hirotaka Hachiya, Yu Yamamamoto, Kazuhiro Hirahara, Naonori Ueda (Part: Lead author )

    Proceedings of the 11th Asian Conference on Machine Learning (ACML2019) ( ACML )    2019.11  [Refereed]

  • info-cycleGANを用いたドメイン適応のためのCG実写変換

    永吉耕太郎, 八谷大岳

    第22回情報論的学習理論ワークショップ ( 電子情報通信学会 )    2019.11

  • ニューラルネットワークを用いた急発達台風予報

    黒良峻平、八 谷大岳、嶋田宇大、上田修功

    日本気象学会2019年度秋季大会 ( 気象学会 )    2019.10

  • 機械学習を用いた異なるパラメータの相対強度マップの統合方法の検討

    八谷 大岳, 平原 和朗, 上田 修功

    地震学会秋季大会 ( 地震学会 )    2019.09

  • 機械学習とアンサンブルカルマンフィルタのハイブリッド手法を用いた南海トラフ巨大地震シミュレータの摩擦パラメータ推定

    山本 友, 平原 和朗, 八谷 大岳, 高橋 温志, 上田 修功

    地震学会秋季大会 ( 地震学会 )    2019.09

  • Machine Learning Approach for Adaptive Integration of Multiple Relative Intensity Models toward Improved Earthquake Forecasts in Japan

    Hirotaka Hachiya, Kazuro Hirahara, Naonori Ueda

    International Union of Geodesy and Geophysics (IUGG2019) ( IUGG )    2019.07  [Refereed]

  • Triple GANs with adversarial disturbances for discriminative anomaly detection

    Hirotaka Hachiya

    情報論的学習理論と機械学習研究会(IBISML2019-4) ( 電子情報通信学会 )    21-26   2019.06

  • Broadband ground-motion synthesis using embeddig machine learning

    omohisa Okazaki, Hirotaka Hachiya, Naonori Ueda, Asako Iwaki, Takahiro Maeda, Hiroyuki Fujiwara

    International Union of Geodesy and Geophysics (IUGG2019)     2019.05  [Refereed]

  • 埋込み機械学習による 長周期波形からの広帯域地震動合成

    岡崎 智久、八谷 大岳、上田 修功、岩城 麻子、前田 宜浩、藤原 広行

    日本地球惑星科学連合2019年大会(JpGU2019)   2019   2019.05

  • Synthesis of Broadband Ground Motions Using Embedding and Neural Networks

    Tomohisa Okazaki, Hirotaka Hachiya, Naonori Ueda, Asako Iwaki, Takahiro Maeda, Hiroyuki Fujiwara

    European Geosciences Union (EGU) ( EGU )    2019.04  [Refereed]

  • Machine learning approach for constraining the plausible ranges of frictional parameters on the Philippine Sea plate reproducing the historical sequences of the Nankai megaquakes

    Hirotaka Hachiya, Yu Yamamamoto, Kazuhiro Hirahara, Atsushi Takahashi, Naonori Ueda

    European Geosciences Union (EGU) ( EGU )    2019.04  [Refereed]

  • ニューラルネットを用いた逆埋め込み関数の近似と、その文書データ分布の解釈への応用

    溝渕湧也, 八谷大岳

    ニューロコンピューティング研究会 ( 電子情報通信学会 )    vol. 118, no. 470, NC2018-54, pp. 61-66   2019.03

  • 信頼度重み付きクラスタリングによる2次元測距センサの距離推定の頑健化

    射手矢和真, 八谷大岳, 中村恭之

    第19回計測自動制御学会システムインテグレーション部門講演会 ( 計測自動制御学会 )    3C1-07   2018.12

  • 2.5D+Orientationアンカーによる物体の距離と向きの推定

    佐々木寛史, 八谷大岳

    第19回計測自動制御学会システムインテグレーション部門講演会 ( 計測自動制御学会 )    3D4-05   2018.12

  • Training Discriminative Model for Anomaly Detection through Generative Adversarial Network

    Hirotaka Hachiya

    第21回情報論的学習理論ワークショップ(IBIS2018) ( 電子情報通信学会 )    2018.11

  • Laser variational autoencoder for map construction and self-localization

    Shohei Wakita, Takayuki Nakamura, Hirotaka Hachiya

    IEEE International Conference on Systems, Man, and Cybernetics ( IEEE )  2018   1P1-G06   2018.10  [Refereed]

     View Summary

    In this paper, we propose a novel method ”laserVAE” for learning feature descriptors of scan data from a 2D LIDAR, which are suitable for self-localization of a mobile robot in an environment. Our laserVAE is an enhanced version of variational autoencoder, which is tuned up for managing step-edges in scan data. Through experiments in a real environment, we demonstrate the effectiveness of the proposed method.

    DOI

  • 機械学習を用いた広帯域地震動合成の試み

    岡﨑智久、八谷大岳、前田宜浩、岩城麻子、藤原広行、上田修功

    地震学会2018年度秋季大会 ( 地震学会 )  2018   2018.10

  • 2.5D Faster R-CNN for Distance Estimation

    Hirotaka Hachiya, Yuki Saito, Kazuma Iteya, Masaya Nomura, Takayuki Nakamura

    IEEE International Conference on Systems, Man, and Cybernetics ( IEEE )    2018.10  [Refereed]

  • オートエンコーダを用いた環境地図の特徴表現と自己位置推定

    脇田翔平、中村恭之、八谷大岳

    ロボティクス・メカトロニクス講演会 ( 日本機械学会ロボティクス・メカトロニクス部門 )  2018   1P1-G06(1)--1P1-G06(3)   2018.06

     View Summary

    In this paper, we propose a novel method ”laserVAE” for learning feature descriptors of scan data from a 2D LIDAR, which are suitable for self-localization of a mobile robot in an environment. Our laserVAE is an enhanced version of variational autoencoder, which is tuned up for managing step-edges in scan data. Through experiments in a real environment, we demonstrate the effectiveness of the proposed method.

    DOI

  • 3Dアンカーによる距離推定とロボットナビゲーションへの応用

    八谷大岳、斎藤侑輝、射手矢和真、野村雅也、中村恭之

    第23回ロボティクスシンポジア ( 計測自動制御学会、日本ロボット学会、日本機械学会 )    354--357   2018.03  [Refereed]

     View Summary

    単眼カメラ画像から特定物体の距離を推定するディープラーニング方法の提案とロボットナビゲーションへの応用

  • 自由領域制限による経路教示と経路計画のハイブリッド自律走行

    野村雅也、中村恭之 、 八谷大岳

    第23回ロボティクスシンポジア ( 計測自動制御学会、日本ロボット学会、日本機械学会 )    145--147   2018.03  [Refereed]

     View Summary

    屋外環境でロボットをロバストに自律走行させるための、経路教示と経路計画を状況により切り替えるハイブリッド自律走行方法を提案

  • 透視投影アンカーを用いた特定物体の検出および距離推定

    八谷大岳、斎藤侑輝、射手矢和真、中村恭之

    第18回システムインテグレーション部門講演会 ( 計測自動制御学会 )    1952--1954   2017.12

     View Summary

    単眼カメラ画像から特定物体の距離を推定するディープラーニング方法の提案

  • ディープラーニングによる特定人物検出と距離推定

    八谷大岳、野村雅也、脇田翔平、射手矢和真、中村恭之

    つくばチャレンジ2017参加レポート集     156   2017

     View Summary

    つくばチャレンジ2017の特定人物探索を題材に開発したディープラーニングを用いた単眼カメラ画像からの距離推定技術の解説

  • 経路教示と経路計画のハイブリッド自律走行

    八谷大岳、野村雅也、脇田翔平、射手矢和真、中村恭之

    つくばチャレンジ2017参加レポート集     155   2017

     View Summary

    つくばチャレンジ2017にて、1kmの自律走行(マイルストーン2)を達成したロボットナビゲーション技術の解説

  • NSH: Normality Sensitive Hashing for Anomaly Detection

    Hirotaka Hachiya, Masakazu Matsugu

    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW) ( IEEE )    795 - 802   2013  [Refereed]

     View Summary

    Locality sensitive hashing (LSH) is a computationally efficient alternative to the distance based anomaly detection. The main advantages of LSH lie in constant detection time, low memory requirement, and simple implementation. However, since the metric of distance in LSHs does not consider the property of normal training data, a naive use of existing LSHs would not perform well. In this paper, we propose a new hashing scheme so that hash functions are selected dependently on the properties of the normal training data for reliable anomaly detection. The distance metric of the proposed method, called NSH (Normality Sensitive Hashing) is theoretically interpreted in terms of the region of normal training data and its effectiveness is demonstrated through experiments on real-world data. Our results are favorably comparable to state-of-the arts with the low-level features.

    DOI

  • Squared-loss Mutual Information Regularization

    Gang Niu, Wittawat Jitkrittum, Bo Dai, Hirotaka Hachiya, Masashi Sugiyama

    International Conference on Machine Learning (ICML2013) ( International Machine Learning Society (IMLS) )    10--18   2013  [Refereed]

  • Feature Selection via l_1-Penalized Squared-Loss Mutual Information

    JITKRITTUM Wittawat, HACHIYA Hirotaka, SUGIYAMA Masashi

    電子情報通信学会技術研究報告. IBISML, 情報論的学習理論と機械学習 = IEICE technical report. IBISML, Information-based induction sciences and machine learning ( The Institute of Electronics, Information and Communication Engineers )  111 ( 480 ) 139 - 146   2012.03

     View Summary

    Feature selection is a technique to screen out less important features. Many existing supervised feature selection algorithms use redundancy and relevancy as the main criteria to select features. However, feature interaction, potentially a key characteristic in real-world problems, has not received much attention. As an attempt to take feature interaction into account, we propose l_1-LSMI, an l_1-regularization based algorithm that maximizes a squared-loss variant of mutual information between selected features and outputs. Numerical results show that l_1-LSMI performs well in handling redundancy, detecting non-linear dependency, and considering feature interaction.

  • Efficient Sample Reuse in Policy Gradients with Parameter-based Exploration

    ZHAO Tingting, HACHIYA Hirotaka, SUGIYAMA Masashi

    電子情報通信学会技術研究報告. IBISML, 情報論的学習理論と機械学習 = IEICE technical report. IBISML, Information-based induction sciences and machine learning ( The Institute of Electronics, Information and Communication Engineers )  111 ( 480 ) 55 - 62   2012.03

     View Summary

    The policy gradient approach is a flexible and powerful reinforcement learning method particularly for problems with continuous actions such as robot control. A common challenge in this scenario is how to stabilize policy gradient estimates for reliable policy updates. In this paper, we combine the following three ideas and give a highly stable and practical policy gradient method: (a) the policy gradients with parameter based exploration, which is a recently proposed policy search method with high stability, (b) an importance sampling technique, which allows us to reuse previously gathered data in an unbiased way, and (c) an optimal baseline, which minimizes the variance of gradient estimates with their unbiasedness being maintained. For the proposed method, we give theoretical analysis of the variance of gradient estimates and show its usefulness through experiments.

  • Modified Newton Approach to Policy Search

    HACHIYA Hirotaka, MORIMURA Tetsuro, MAKINO Takaki, SUGIYAMA Masashi

    電子情報通信学会技術研究報告 : 信学技報 ( The Institute of Electronics, Information and Communication Engineers )  111 ( 275 ) 79 - 85   2011.11

     View Summary

    The natural policy gradient method was shown to be a useful approach to policy search in reinforcement learning. However, its potential weakness is that information on returns is not reflected in the metric of natural gradients, implying that it is not adaptive to data and thus less flexible. To overcome this, we propose to use Newton's method which uses the Hessian of the expected return as a metric. However, the naive implementation of Newton's method does not guarantee the Hessian to be negative definite, which causes instability on policy updates. To cope with this problem, we propose an adaptive scheme to keep the Hessian nonnegative. We demonstrate the effectiveness of our proposed method in standard reinforcement learning tasks.

  • Relative Density-Ratio Estimation for Robust Distribution Comparison

    YAMADA Makoto, SUZUKI Taiji, KANAMORI Takafumi, HACHIYA Hirotaka, SUGIYAMA Masashi

    電子情報通信学会技術研究報告 : 信学技報 ( The Institute of Electronics, Information and Communication Engineers )  111 ( 275 ) 25 - 32   2011.11

     View Summary

    Divergence estimators based on direct approximation of density-ratios without going through separate approximation of numerator and denominator densities have been successfully applied to machine learning tasks that involve distribution comparison such as outlier detection, transfer learning, and two-sample homogeneity test. However, since density-ratio functions often possess high fluctuation, divergence estimation is still a challenging task in practice. In this paper, we propose to use relative divergences for distribution comparison, which involves approximation of relative density-ratios. Since relative density-ratios are always smoother than corresponding ordinary density-ratios, our proposed method is favorable in terms of the non-parametric convergence speed. Furthermore, we show that the proposed divergence estimator has asymptotic variance independent of the model complexity under a parametric setup, implying that the proposed estimator hardly overfits even with complex models. Through experiments, we demonstrate the usefulness of the proposed approach.

  • Computationally Efficient Multi-Label Classification by Least-Squares Probabilistic Classifier

    NAM Hyunha, HACHIYA Hirotaka, SUGIYAMA Masashi

    電子情報通信学会技術研究報告 : 信学技報 ( The Institute of Electronics, Information and Communication Engineers )  111 ( 275 ) 213 - 216   2011.11

     View Summary

    Multi-label classification allows a sample to belong to multiple classes simultaneously, which is often the case in real-world applications such as audio tagging, image annotation, video search, and text mining. In such a multi-label scenario, taking into account correlation between multiple labels can boost the classification accuracy. However, this in turn makes classifier training more challenging because handling multiple labels tends to induce a high-dimensional optimization problem. In this paper, we propose a highly scalable multi-label classifier based on a computationally efficient classification algorithm called the least-squares probabilistic classifier. Through experiments, we show the usefulness of our proposed method.

  • Artist Agent A2: Stroke Painterly Rendering Based on Reinforcement Learning

    Ning Xie, Hirotaka Hachiya, Masashi Sugiyama

    研究報告コンピュータビジョンとイメージメディア(CVIM)   2011 ( 18 ) 1 - 7   2011.08

     View Summary

    Oriental ink painting, called Sumi-e, is one of the most appealing painting styles that has attracted artists around the world. The major challenges in computer-based Sumi-e simulation are to abstract complex scene information and draw smooth and natural brush strokes. To automatically find such strokes, we propose to model the brush as a reinforcement-learning (RL) agent, and learn desired brush-trajectories by maximizing the sum of rewards in the policy search framework. We also provide elaborate design of state space, action space, and a reward function tailored for a Sumi-e agent. The effectiveness of our proposed approach is demonstrated through simulated Sumi-e experiments.Oriental ink painting, called Sumi-e, is one of the most appealing painting styles that has attracted artists around the world. The major challenges in computer-based Sumi-e simulation are to abstract complex scene information and draw smooth and natural brush strokes. To automatically find such strokes, we propose to model the brush as a reinforcement-learning (RL) agent, and learn desired brush-trajectories by maximizing the sum of rewards in the policy search framework. We also provide elaborate design of state space, action space, and a reward function tailored for a Sumi-e agent. The effectiveness of our proposed approach is demonstrated through simulated Sumi-e experiments.

  • Artist Agent A^2 : Stroke Painterly Rendering Based on Reinforcement Learning

    XIE Ning, HACHIYA Hirotaka, SUGIYAMA Masashi

    IEICE technical report ( The Institute of Electronics, Information and Communication Engineers )  111 ( 194 ) 119 - 125   2011.08

     View Summary

    Oriental ink painting, called Sumi-e, is one of the most appealing painting styles that has attracted artists around the world. The major challenges in computer-based Sumi-e simulation are to abstract complex scene information and draw smooth and natural brush strokes. To automatically find such strokes, we propose to model the brush as a reinforcement-learning (RL) agent, and learn desired brush-trajectories by maximizing the sum of rewards in the policy search framework. We also provide elaborate design of state space, action space, and a reward function tailored for a Sumi-e agent. The effectiveness of our proposed approach is demonstrated through simulated Sumi-e experiments.

  • Artist Agent A^2 : Stroke Painterly Rendering Based on Reinforcement Learning

    XIE Ning, HACHIYA Hirotaka, SUGIYAMA Masashi

    IEICE technical report ( The Institute of Electronics, Information and Communication Engineers )  111 ( 193 ) 119 - 125   2011.08

     View Summary

    Oriental ink painting, called Sumi-e, is one of the most appealing painting styles that has attracted artists around the world. The major challenges in computer-based Sumi-e simulation are to abstract complex scene information and draw smooth and natural brush strokes. To automatically find such strokes, we propose to model the brush as a reinforcement-learning (RL) agent, and learn desired brush-trajectories by maximizing the sum of rewards in the policy search framework. We also provide elaborate design of state space, action space, and a reward function tailored for a Sumi-e agent. The effectiveness of our proposed approach is demonstrated through simulated Sumi-e experiments.

  • Information-Maximization Clustering : Analytic Solution and Model Selection

    SUGIYAMA Masashi, YAMADA Makoto, KIMURA Manabu, HACHIYA Hirotaka

    IEICE technical report ( The Institute of Electronics, Information and Communication Engineers )  110 ( 476 ) 69 - 76   2011.03

     View Summary

    A recently-proposed information-maximization clustering method (Gomes et al., NIPS2010) learns a kernel logistic regression classifier in an unsupervised manner so that mutual information between feature vectors and cluster assignments is maximized. A notable advantage of this approach is that it only involves continuous optimization of a logistic model, which is substantially easier than discrete optimization of cluster assignments. However, this method still suffers from two weaknesses: (i) manual tuning of kernel parameters is necessary, and (ii) finding a good local optimal solution is not straightforward due to the strong non-convexity of logistic-regression learning. In this paper, we first show that the kernel parameters can be systematically optimized by maximizing mutual information estimates. We then propose an alternative information-maximization clustering approach using a squared-loss variant of mutual information. This novel approach allows us to obtain clustering solutions analytically in a computationally very efficient way. Through experiments, we demonstrate the usefulness of the proposed approaches.

  • Return distribution estimation with dynamic programming

    MORIMURA Tetsuro, SUGIYAMA Masashi, KASHIMA Hisashi, HACHIYA Hirotaka, TANAKA Toshiyuki

    IEICE technical report ( The Institute of Electronics, Information and Communication Engineers )  110 ( 265 ) 283 - 290   2010.10

     View Summary

    In the standard reinforcement learning framework, the expectation of returns (i.e., the discounted sum of rewards) is estimated via the Bellman equation for decision making. Recently, we extended this framework and proposed methods for estimating the distribution of returns based on the distributional Bellman equation. Although these methods allow us to deal with arbitrary criteria for risk-sensitive decision making such as the value-at-risk, their theoretical properties such as convergence issues were left as an open research issue. In this paper, we prove that solving the distributional Bellman equation by dynamic programming is always convergent irrespective of initialization. We further derive the rate of convergence for the moment of the return distribution estimator. Finally, based on the obtained theoretical results, we propose an improved method for return distribution estimation and demonstrate its effectiveness through numerical experiments.

  • New Feature Selection Method for Reinforcement Learning : Conditional Mutual Information Reveals Implicit State-Reward Dependency

    HACHIYA Hirotaka, SUGIYAMA Masashi

    IEICE technical report ( The Institute of Electronics, Information and Communication Engineers )  110 ( 76 ) 137 - 144   2010.06

     View Summary

    Model-free reinforcement learning (RL) is a machine learning approach to decision making in unknown environment. However, real-world RL tasks often involve high-dimensional state space, and then standard RL methods do not perform well. In this paper, we propose a new feature selection framework for coping with high dimensionality. Our proposed framework adopts conditional mutual information between state and return sequences as a feature selection criterion, allowing the evaluation of implicit state-reward dependency. The conditional mutual information is approximated by a least-squares method, which results in a computationally efficient feature selection procedure. The usefulness of the proposed method is demonstrated on simulated mobile-robot navigation experiments.

  • Improving Model-based Reinforcement Learning with Multitask Learning (数理モデル化と問題解決(MPS) Vol.2009-MPS-76)

    SIMM JAAK, SUGIYAMA MASASHI, HACHIYA HIROTAKA

    情報処理学会研究報告 ( 情報処理学会 )  2009 ( 5 ) 1 - 8   2010.02

  • Conditional Density Estimation Based on Density Ratio Estimation

    SUGIYAMA MASASHI, TAKEUCHI ICHIRO, SUZUKI TAIJI, KANAMORI TAKAFUMI, HACHIYA HIROTAKA, OKANOHARA DAISUKE

    研究報告バイオ情報学(BIO)   2009 ( 4 ) 1 - 8   2009.12

     View Summary

    Estimating the conditional mean of an input-output relation is the goal of regression. However, regression analysis is not sufficiently informative if the conditional distribution has multi-modality, is highly asymmetric, or contains heteroscedastic noise. In such scenarios, estimating the conditional distribution itself would be more useful. In this paper, we propose a novel method of conditional density estimation that is suitable for multi-dimensional continuous variables. The basic idea of the proposed method is to express the conditional density in terms of the density ratio and the ratio is directly estimated without going through density estimation.Estimating the conditional mean of an input-output relation is the goal of regression. However, regression analysis is not sufficiently informative if the conditional distribution has multi-modality, is highly asymmetric, or contains heteroscedastic noise. In such scenarios, estimating the conditional distribution itself would be more useful. In this paper, we propose a novel method of conditional density estimation that is suitable for multi-dimensional continuous variables. The basic idea of the proposed method is to express the conditional density in terms of the density ratio and the ratio is directly estimated without going through density estimation.

  • Improving Model-based Reinforcement Learning with Multitask Learning

    SIMM JAAK, SUGIYAMA MASASHI, HACHIYA HIROTAKA

    研究報告バイオ情報学(BIO) ( 情報処理学会 )  2009 ( 3 ) 1 - 8   2009.12

     View Summary

    We introduce an extension to standard reinforcement learning setting called observational RL (ORL) where additional observational information is available to the agent. This allows the agent to learn the system dynamics with fewer data samples, which is an essential feature for practical applications of RL methods. We show that ORL can be formulated as a multitask learning problem. A similarity-based and a component-based multitask learning methods are proposed for learning the transition probabilities of the ORL problem. The effectiveness of the proposed methods is evaluated in experiments of grid world and object lifting tasks.We introduce an extension to standard reinforcement learning setting called observational RL (ORL) where additional observational information is available to the agent. This allows the agent to learn the system dynamics with fewer data samples, which is an essential feature for practical applications of RL methods. We show that ORL can be formulated as a multitask learning problem. A similarity-based and a component-based multitask learning methods are proposed for learning the transition probabilities of the ORL problem. The effectiveness of the proposed methods is evaluated in experiments of grid world and object lifting tasks.

  • Statistical Active Learning for Efficient Value Function Approximation in Reinforcement Learning

    AKIYAMA Takayuki, HACHIYA Hirotaka, SUGIYAMA Masashi

    IEICE technical report ( The Institute of Electronics, Information and Communication Engineers )  108 ( 480 ) 261 - 266   2009.03

     View Summary

    Appropriately designing sampling policies is highly important for obtaining better control policies in reinforcement learning. In this paper, we first show that the least-squares policy iteration (LSPI) framework allows us to employ statistical active learning methods for linear regression. Then we propose a design method of good sampling policies for efficient exploration, which is particularly useful when the sampling cost of immediate rewards is high. The proposed method combined with LSPI is called active policy iteration (API). Through simulations we demonstrate the usefulness of API.

  • Adaptive Importance Sampling with Automatic Model Selection in Reward Weighted Regression

    HACHIYA Hirotaka, PETERS Jan, SUGIYAMA Masashi

    IEICE technical report ( The Institute of Electronics, Information and Communication Engineers )  108 ( 480 ) 249 - 254   2009.03

     View Summary

    Direct policy search is a promising reinforcement learning framework in particular for controlling in continuous, high-dimensional systems such as anthropomorphic robot. Policy search often requires a large number of samples for obtaining a stable policy update estimator due to its high flexibility. However, this is prohibitive when the sampling cost is expensive. In this paper, we extend an EM-based policy search method so that previously collected samples can be efficiently reused. The usefulness of the proposed method, called Reward-weighted Regression with sample Reuse (R^3), is demonstrated through a toy example.

  • Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation

    HACHIYA Hirotaka, AKIYAMA Takayuki, SUGIYAMA Masashi

    IEICE technical report ( The Institute of Electronics, Information and Communication Engineers )  107 ( 410 ) 75 - 80   2007.12

     View Summary

    Off-policy reinforcement learning is aimed at efficiently reusing data samples gathered in the past. A common approach is to use importance sampling techniques for compensating for the bias caused by the difference between data-collecting policies and the target policy. However, existing off-policy methods do not often take the variance of value function estimators explicitly into account and therefore their performance tends to be unstable. To cope with this problem, we propose using an adaptive importance sampling technique which allows us to actively control the trade-off between bias and variance. We further provide a method for optimally determining the trade-off parameter based on a statistical machine learning theory.

  • Robot Control by Least-Squares Policy Iteration with Geodesic Gaussian Kernels

    Hachiya Hirotaka, Sugiyama Masashi

    人工知能学会全国大会論文集 ( 人工知能学会 )  21   1 - 4   2007

▼display all

Works

  • ロボカップジュニア・ジャパンオープン2018

    2018.03

     View Summary

    2017.3.31-2018.4.1,主催:ロボカップジュニア・ジャパンオープン2018和歌山大会開催委員会,規模:全国大会,競技会開催(審判、運営協力者),ポスター審査員

  • つくばチャレンジ2017

    2017.11

     View Summary

    2017.11.10,主催:つくばチャレンジ実行委員会、つくば市,規模:53チーム・65台のロボットが出場,受賞等:自律走行のマイルストーン2(1km)の達成,全国大会

Conference Activities & Talks

  • Position-dependent inpainting for ground motion interpolation

    Hirotaka Hachiya

    Minisymposium, 10th International Congress on Industrial and Applied Mathematics  2023.08  

  • 強震動データベースの構築と最新技術を用いたデータベースの活用

    八谷大岳  [Invited]

    強震動データベースの構築と最新技術を用いたデータベースの活用  2023.02  

  • Transformer-based fully trainable model for poin;process with virtual sequence vectors;its experimental evaluation

    Fumiya Nishizawa, Sujun Hong, Hirotaka Hachiya

    第25回情報論的学習理論ワークショップ  2022.11  

  • 集合組み込みベクトルを用いたAttentionベースの順不変集合データマッチング

    中村晟人, 八谷大岳

    第24回情報論的学習理論ワークショップ  2021.11  

  • マルチクラスAUC最大化を用いた台風発達予報

    黒良 峻平, 八谷 大岳, 嶋田 宇大, 上田 修功

    第23回情報論的学習理論ワークショップ  2020.11  

  • Deep Inpaintingと空間分布マッチングの組み合わせによる地震動データの空間補完

    永吉 耕太郎, 八谷 大岳, 藤原 広行, 上田 修功, 岩城 麻子, 前田 宜浩

    第23回情報論的学習理論ワークショップ  2020.11  

  • オートエンコーダを用いた時系列解析のための高自由度な面的点過程モデル

    洪 秀俊, 八谷 大岳

    第23回情報論的学習理論ワークショップ  2020.11  

  • 機械学習を用いた南海トラフ巨大地震シミュレータの摩擦パラメータ推定

    山本友, 平原和朗,八谷大岳, 上田修功

    固体地球科学データ同化に関する研究会  2020.02  

  • info-cycleGANを用いたドメイン適応のためのCG実写変換

    永吉耕太郎, 八谷大岳

    第22回情報論的学習理論ワークショップ  2019.11   電子情報通信学会

  • 経路教示と経路計画のハイブリッド自律走行

    八谷大岳、野村雅也、脇田翔平、射手矢和真、中村恭之

    つくばチャレンジ2017参加レポート集  2017  

     View Summary

    つくばチャレンジ2017にて、1kmの自律走行(マイルストーン2)を達成したロボットナビゲーション技術の解説

  • ディープラーニングによる特定人物検出と距離推定

    八谷大岳、野村雅也、脇田翔平、射手矢和真、中村恭之

    つくばチャレンジ2017参加レポート集  2017  

     View Summary

    つくばチャレンジ2017の特定人物探索を題材に開発したディープラーニングを用いた単眼カメラ画像からの距離推定技術の解説

  • Robot Control by Least-Squares Policy Iteration with Geodesic Gaussian Kernels

    Hachiya Hirotaka, Sugiyama Masashi

    人工知能学会全国大会論文集  2007   人工知能学会

▼display all

Patents

  • Information processing apparatus, information processing method, and non-transitory computer-readable storage medium

    Patent no: 11468290

    Date registered: 2022.10.11  US

    Date applied: 2017.06.28 ( 15/635,302 )  

    Inventor(s)/Creator(s): Hirotaka Hachiya  Applicant: Canon Inc.

  • Surveillance apparatus, surveillance method, and storage medium

    Patent no: 11363241

    Date registered: 2022.06.14  US

    Date applied: 2019.11.05 ( 16/675,090 )  

    Inventor(s)/Creator(s): Shunsuke Sato , Hirotaka Hachiya , Yusuke Mitarai  Applicant: Canon Inc.

  • 情報処理装置、情報処理方法、及びプログラム

    Patent no: 特許第6976731号

    Date registered: 2021.12.08 

    Date applied: 2017.06.13 ( 特願2017-115995 )  

    Inventor(s)/Creator(s): 塚本 健二 , 八谷 大岳 , 森 克彦  Applicant: キヤノン株式会社

  • 情報処理装置、情報処理方法

    Patent no: 特許第6948851号

    Date registered: 2021.10.13 

    Date applied: 2017.06.16 ( 特願2017-118841 )  

    Inventor(s)/Creator(s): 八谷 大岳  Applicant: キヤノン株式会社

  • 情報処理装置、情報処理方法、及びプログラ

    Patent no: 特許第6945999号

    Date registered: 2021.10.06 

    Date applied: 2016.12.22 ( 特願2016-249292 )  

    Inventor(s)/Creator(s): 八谷 大岳  Applicant: キヤノン株式会社

  • 認識学習装置、認識学習方法及びプログラム

    Patent no: 特許第6900190号

    Date registered: 2021.06.18 

    Date applied: 2016.12.28 ( 特願2016-256060 )  

    Inventor(s)/Creator(s): 八谷 大岳 , 真継 優和  Applicant: キヤノン株式会社

  • 監視装置、監視方法、コンピュータプログラム、及び記憶媒体

    Patent no: 特許第6766009号

    Date registered: 2020.09.18 

    Date applied: 2017.05.09 ( 特願2017-093337 )  

    Inventor(s)/Creator(s): 佐藤 俊介 , 八谷 大岳 , 御手洗 裕輔 

  • 情報処理装置、情報処理方法、プログラム

    Patent no: 特許第6590477号

    Date registered: 2019.09.27 

    Date applied: 2014.11.28 ( 特願2014-242462 )  

    Inventor(s)/Creator(s): 八谷 大岳  Applicant: キヤノン株式会社

  • 異常検知方法、異常検知装置、及びプログラム

    Patent no: 特許6第525542号

    Date registered: 2019.05.17 

    Date applied: 2014.10.17 ( 特願2014-213182 )  

    Inventor(s)/Creator(s): 塚本 健二 , 八谷 大岳 , 森 克彦  Applicant: キヤノン株式会社

  • Recognition training apparatus, recognition training method, and storage medium

    Patent no: 10217027

    Date registered: 2019.02.26  US

    Date applied: 2017.06.13 ( 15/406,391 )  

    Inventor(s)/Creator(s): Hirotaka Hachiya , Masakazu Matsugu  Applicant: Canon Inc.

  • 識別装置及びデータ関係生成装置

    Patent no: 特許第6478650号

    Date registered: 2019.02.15 

    Date applied: 2015.01.16 ( 特願2015-006901 )  

    Inventor(s)/Creator(s): 八谷 大岳  Applicant: キヤノン株式会社

  • Information processing apparatus and information processing method

    Patent no: 10013628

    Date registered: 2018.06.03 

    Date applied: 2015.03.19 ( 14/662,488 )  

    Inventor(s)/Creator(s): Masami Kato , Hirotaka Hachiya  Applicant: Canon Inc.

  • ハッシュ値生成装置、システム、判定方法、プログラム、記憶媒体

    Patent no: 特許第6164899号

    Date registered: 2017.06.30 

    Date applied: 2013.04.05 ( 特願2013-079445 )  

    Inventor(s)/Creator(s): 八谷 大岳  Applicant: キヤノン株式会社

▼display all

Research Exchange

  • Tech Connect KANSAI

    2019.01
     
  • 和歌山大学/鳥取大学合同ビジネス連携交流会

    2018.10
     
  • 第3回 和歌山大学・和歌山県工業技術センター研究者交流会

    2018.03
     
  • 工学研究シーズ合同発表会

    2017.10
     

KAKENHI

  • Attention機構に基づく異種集合マッチング方式の分析と新方式の提案

    2023.04
    -
    2026.03
     

    Grant-in-Aid for Scientific Research(C)  Principal investigator

  • 強震動予測・地震ハザード解析における不確かさの定量評価に向けた研究

    2023.04
    -
    2026.03
     

    Grant-in-Aid for Scientific Research(A)  Co-investigator

  • 深層学習と統計モデリングの融合による自然現象予報のための画像変換方法の検討

    2020.04
    -
    2023.03
     

    Grant-in-Aid for Scientific Research(C)  Principal investigator

  • 観測データと理論データの融合に基づくデータ駆動型強振動予測モデルの開発

    2020.04
    -
    2023.03
     

    Grant-in-Aid for Scientific Research(A)  Co-investigator

  • 環境変動にロバストなディープニューラルネットのための学習データ生成方法の研究

    2017.10
    -
    2019.03
     

    Grant-in-Aid for Research Activity Start-up  Principal investigator

Instructor for open lecture, peer review for academic journal, media appearances, etc.

  • わかやま地域情報化フォーラム

    2024.01.23

    和歌山県情報化推進協議会

     View Details

    地方自治体、生成AI

    トークセッション出演

  • 技術アドバイザー

    2023.07.01
    -
    2024.06.30

    株式会社ZOZO NEXT

     View Details

    助言・指導

    ZOZO Researchに関する技術アドバイザー業務

  • 講演講師

    2022.12.16

    一般社団法人 電子情報通信学会 関西支部

     View Details

    和歌山高専の学生

    12月16日に和歌山工業高等専門学校にて開催する「学生のための講演会」において、「機械学習」について講演する。

  • 技術相談役

    2022.04.01
    -
    2022.06.30

    株式会社QIS

     View Details

    助言・指導

    データ分析、画像認識に関する指導・アドバイス・技術相談および調査

  • 査読

    2021.10

    計測自動制御学会

     View Details

    学術雑誌等の編集委員・査読・審査員等

    計測自動制御学会論文集の査読

  • 技術アドバイザー

    2020.04.01
    -
    Now

    株式会社 サイバーリンクス

     View Details

    助言・指導

    機械学習コンサルティングの助言及び指導

  • 客員研究員

    2020.04.01
    -
    Now

    国立研究開発法人理化学研究所

     View Details

    客員研究員

    各種センサーデータの解析及び防災・減災シミュレータの研究開発

  • Reviewer

    2020.02
    -
    2020.04

    International Conference on Machine Learning

     View Details

    学術雑誌等の編集委員・査読・審査員等

    Reviewer

  • 講師

    2019.11

    和歌山県立向陽高等学校・中学校 和元年度向陽SSH中高合同ゼミ(実験講座)

     View Details

    講演講師等

    講師,任期:2019年11月~

  • 高度IT研修

    2019.05

    日比谷コンピュータシステム

     View Details

    公開講座・講演会の企画・講師等

    機械学習の概要、応用例およびPythonを用いた機械学習の実装演習,日付:2019.5.29

  • 講師

    2019.05

    株式会社HCSホールディングス

     View Details

    講演講師等

    講師,任期:2019年5月~

  • SSH中高合同ゼミ

    2019.04

    その他

     View Details

    小・中・高校生を対象とした学部体験入学・出張講座等

    和歌山県立向陽高校にて、機械学習入門の出張講義を実施,日付:2019年11月8日

  • AI技術講演会

    2019.03

    和歌山県工業技術センター

     View Details

    公開講座・講演会の企画・講師等

    最先端の機械学習とその応用,日付:2019.3.5

  • 講師

    2019.03

    和歌山県工業技術センター

     View Details

    講演講師等

    講師,任期:2019年3月~

  • 若手研究者研究成果発表会

    2018.12

    和歌山情報サービス産業協会、わかやま産業振興財団

     View Details

    公開講座・講演会の企画・講師等

    ディープラーニングを用いたセンサ ーデータの圧縮と変換,日付:2018.12.13

  • Reviewer

    2018.06
    -
    2018.07

    Conference on Neural Information Processing Systems

     View Details

    学術雑誌等の編集委員・査読・審査員等

    Reviewer

  • Reviewer

    2018.04
    -
    2018.05

    IEEE International Conference on System, Man, and Cybernetics

     View Details

    学術雑誌等の編集委員・査読・審査員等

    Reviewer

  • Program Committee

    2018.01
    -
    2018.04

    International Conference on Machine Learning

     View Details

    学術雑誌等の編集委員・査読・審査員等

    Program Committee

  • 第26回わかやまテクノビジネスフェア

    2017.11

    公益財団法人わかやま産業振興財団

     View Details

    公開講座・講演会の企画・講師等

    機械学習の研究に関する講演およびポスター発表,日付:2017.11.10

  • Reviewer

    2017.10
    -
    2017.11

    IEEE International Conference on Robotics and Automation

     View Details

    学術雑誌等の編集委員・査読・審査員等

    Reviewer

  • Program Committee

    2017.09
    -
    2017.10

    Thirty-Second AAAI Conference on Artificial Intelligence

     View Details

    学術雑誌等の編集委員・査読・審査員等

    Program Committee

  • カンボジア王立プノンペン大学の学生・教員の和歌山大学訪問

    2017.08

    大阪府立大学現代システム科学・さくらサイエンスプラン

     View Details

    公開講座・講演会の企画・講師等

    機械学習の研究に関する講演,日付:2017.8.10

  • Reviewer

    2017.06
    -
    2017.07

    Conference on Neural Information Processing Systems

     View Details

    学術雑誌等の編集委員・査読・審査員等

    Reviewer

  • Associate Editor

    2017.02
    -
    2017.05

    IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2017)

     View Details

    学術雑誌等の編集委員・査読・審査員等

    Associate Editor

▼display all

Committee member history in academic associations, government agencies, municipalities, etc.

  • プログラム委員会幹事補

    2022.06.01
    -
    2023.03.31
     

    第28回ロボティクスシンポジア

     View Details

    ロボット、システム、メカトロニクス、ロボティクス

    プログラム委員会の作業全般

  • 和歌山県商工観光労働部所管公募型プロポーザル方式等事業者選定委員会委員

    2022.05.16
    -
    2023.06.30
     

    和歌山県

     View Details

    デジタル技術講習

    和歌山県令和4年度デジタル技術講習(データ・クラウド)、(AI・IoT)企画運営業務の事業者選定に当たり、専門的知見を活かし候補者の提案内容を審査いただく。

  • わかやま地域活性化雇用創造プロジェクト事業審査委員会委員

    2021.05.03
    -
    2022.03.31
     

    公益財団法人わかやま産業振興財団

     View Details

    雇用創造 地域活性化

    わかやま地域活性化雇用創造プロジェクト事業費補助金「先端技術導入支援事業」の県内事業者からの申請書の審査

  • 顧問・機械学習コンサルティング

    2019.08
    -
    2020.03
     

    株式会社サイバーリンクス

     View Details

    国や地方自治体、他大学・研究機関等での委員

    顧問・機械学習コンサルティング,任期:2019年8月~2020年3月

  • 客員研究員

    2019.04
    -
    2020.03
     

    国立研究開発法人理化学研究所

     View Details

    国や地方自治体、他大学・研究機関等での委員

    客員研究員,任期:2019年4月~2020年3月

  • ロボカップジュニアポスター審査員

    2018.04
    -
    2019.03
     

    RoboCupJunior Japan Association

     View Details

    学協会、政府、自治体等の公的委員

    ロボカップジュニアポスター審査

  • 客員研究員

    2018.04
    -
    2019.03
     

    国立研究開発法人理化学研究所

     View Details

    国や地方自治体、他大学・研究機関等での委員

    客員研究員,任期:2018年4月~2019年3月

  • 客員研究員

    2018.04
    -
    2019.03
     

    理化学研究所

     View Details

    学協会、政府、自治体等の公的委員

    防災科学チームにおいて、各種センサーデータの解析および 防災・減災シミュレータの研究開発

  • 客員研究員

    2017.04
    -
    2018.03
     

    理化学研究所

     View Details

    学協会、政府、自治体等の公的委員

    防災科学チームにおいて、各種センサーデータの解析および 防災・減災シミュレータの研究開発

▼display all

Other Social Activities

  • 日本酒AI専門技術研究会 会長

    2020.04
    -
    2021.03

    日本酒AI専門技術研究会

     View Details

    産業界、行政諸機関等と行った共同研究、新技術創出、コンサルティング等

    和歌山県内の醸造メーカー、IT企業、および和歌山工業技術センターが参画する日本酒AI専門技術研究会を立ち上げた。AIなど情報技術の日本酒の製造での活用に向けて、県内外の専門家および研究者を招待し、勉強会を4回開催した。