{"id":303,"date":"2017-02-01T14:20:23","date_gmt":"2017-02-01T12:20:23","guid":{"rendered":"http:\/\/members.loria.fr\/ADeleforge\/?p=303"},"modified":"2019-12-20T13:40:37","modified_gmt":"2019-12-20T11:40:37","slug":"the-vast-project","status":"publish","type":"post","link":"https:\/\/members.loria.fr\/ADeleforge\/the-vast-project\/","title":{"rendered":"The VAST project"},"content":{"rendered":"<p><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-286 alignright\" src=\"http:\/\/members.loria.fr\/ADeleforge\/wp-content\/blogs.dir\/192\/files\/sites\/192\/2018\/08\/small-300x300.png\" alt=\"\" width=\"180\" height=\"180\" srcset=\"https:\/\/members.loria.fr\/ADeleforge\/wp-content\/blogs.dir\/192\/files\/sites\/192\/2018\/08\/small-300x300.png 300w, https:\/\/members.loria.fr\/ADeleforge\/wp-content\/blogs.dir\/192\/files\/sites\/192\/2018\/08\/small-150x150.png 150w, https:\/\/members.loria.fr\/ADeleforge\/wp-content\/blogs.dir\/192\/files\/sites\/192\/2018\/08\/small-60x60.png 60w, https:\/\/members.loria.fr\/ADeleforge\/wp-content\/blogs.dir\/192\/files\/sites\/192\/2018\/08\/small.png 520w\" sizes=\"auto, (max-width: 180px) 100vw, 180px\" \/><\/p>\n<p>VAST stands for\u00a0<strong>virtual acoustic space traveling <\/strong>and is a new paradigm for learning-based\u00a0<strong>sound source localization <\/strong>and<strong> audio scene geometry estimation.<\/strong> Most existing methods that estimate the position of a sound source or other audio geometrical properties are either based on an approximate physical model (<em>physics-driven<\/em>) or on a specific-purpose calibration set (<em>data-driven<\/em>). With VAST, the idea is to learn a mapping from audio features to desired geometrical properties using a\u00a0<strong>massive dataset of simulated room impulse responses<\/strong>. The dataset is designed to be maximally representative of the potential audio scenes the considered system may be evolving in while remaining reasonably compact. The aim is to demonstrate\u00a0the good generalizability of mappings learned on a virtual\u00a0datasets\u00a0 to real-world data and to provide a useful tool for research teams interested in sound source localization.<\/p>\n<p>&nbsp;<\/p>\n<p>Cl\u00e9ment Gaultier, Saurabh Kataria, Diego Di Carlo and myself are working on the release of datasets for VAST. Two binaural datasets are already available on the project website. We co-authored two publications demonstrating this paradigm for binaural 3D sound source localization and wall absorption estimations using these datasets.<\/p>\n<ul>\n<li><strong>Website:<\/strong>\u00a0<a href=\"http:\/\/theVASTproject.inria.fr\">http:\/\/theVASTproject.inria.fr<\/a><\/li>\n<li><strong>References:<\/strong>\n<ul>\n<li><a href=\"https:\/\/hal.archives-ouvertes.fr\/hal-01416508\" target=\"_blank\" rel=\"noopener noreferrer\">VAST : The Virtual Acoustic Space Traveler Dataset<\/a>, Cl\u00e9ment Gaultier, Saurabh Kataria, Antoine Deleforge,\u00a0<i>International Conference on Latent Variable Analysis and Signal Separation (LVA\/ICA)<\/i>, Feb 2017, Grenoble, France.\u00a0<a href=\"https:\/\/hal.archives-ouvertes.fr\/hal-01416508\/file\/main_lva2017_gaultier.pdf\" target=\"_blank\" rel=\"noopener noreferrer\"><img decoding=\"async\" title=\"https:\/\/hal.archives-ouvertes.fr\/hal-01416508\/file\/main_lva2017_gaultier.pdf\" src=\"https:\/\/haltools.inria.fr\/images\/Haltools_pdf.png\" alt=\"https:\/\/hal.archives-ouvertes.fr\/hal-01416508\/file\/main_lva2017_gaultier.pdf\" border=\"0\" \/><\/a> <a href=\"https:\/\/hal.archives-ouvertes.fr\/hal-01416508\/bibtex\" target=\"_self\" rel=\"noopener noreferrer\"> <img decoding=\"async\" title=\"BibTex\" src=\"https:\/\/haltools.inria.fr\/images\/Haltools_bibtex3.png\" alt=\"BibTex\" border=\"0\" \/><\/a><\/li>\n<li><a href=\"https:\/\/hal.inria.fr\/hal-01372435v2\" target=\"_blank\" rel=\"noopener noreferrer\">Hearing in a shoe-box : binaural source position and wall absorption estimation using virtually supervised learning<\/a>, Saurabh Kataria, Cl\u00e9ment Gaultier, Antoine Deleforge,\u00a0<i>IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)<\/i>, Mar 2017, New-Orleans, United States.\u00a0<a href=\"https:\/\/hal.inria.fr\/hal-01372435\/file\/main_revised.pdf\" target=\"_blank\" rel=\"noopener noreferrer\"><img decoding=\"async\" title=\"https:\/\/hal.inria.fr\/hal-01372435\/file\/main_revised.pdf\" src=\"https:\/\/haltools.inria.fr\/images\/Haltools_pdf.png\" alt=\"https:\/\/hal.inria.fr\/hal-01372435\/file\/main_revised.pdf\" border=\"0\" \/><\/a>\u00a0<span class=\"LienBibtexACoteFulltext\"><a href=\"https:\/\/hal.inria.fr\/hal-01372435v2\/bibtex\" target=\"_self\" rel=\"noopener noreferrer\"><img decoding=\"async\" title=\"BibTex\" src=\"https:\/\/haltools.inria.fr\/images\/Haltools_bibtex3.png\" alt=\"BibTex\" border=\"0\" \/><\/a><\/span><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>VAST stands for\u00a0virtual acoustic space traveling and is a new paradigm for learning-based\u00a0sound source localization and audio scene geometry estimation. Most existing methods that estimate the position of a sound source or other audio geometrical properties are either based on an approximate physical model (physics-driven) or on a specific-purpose calibration set (data-driven). With VAST, the [&hellip;]<\/p>\n","protected":false},"author":176,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[8,1],"tags":[],"class_list":["post-303","post","type-post","status-publish","format-standard","hentry","category-datasets","category-non-classe"],"_links":{"self":[{"href":"https:\/\/members.loria.fr\/ADeleforge\/wp-json\/wp\/v2\/posts\/303","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/members.loria.fr\/ADeleforge\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/members.loria.fr\/ADeleforge\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/members.loria.fr\/ADeleforge\/wp-json\/wp\/v2\/users\/176"}],"replies":[{"embeddable":true,"href":"https:\/\/members.loria.fr\/ADeleforge\/wp-json\/wp\/v2\/comments?post=303"}],"version-history":[{"count":3,"href":"https:\/\/members.loria.fr\/ADeleforge\/wp-json\/wp\/v2\/posts\/303\/revisions"}],"predecessor-version":[{"id":349,"href":"https:\/\/members.loria.fr\/ADeleforge\/wp-json\/wp\/v2\/posts\/303\/revisions\/349"}],"wp:attachment":[{"href":"https:\/\/members.loria.fr\/ADeleforge\/wp-json\/wp\/v2\/media?parent=303"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/members.loria.fr\/ADeleforge\/wp-json\/wp\/v2\/categories?post=303"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/members.loria.fr\/ADeleforge\/wp-json\/wp\/v2\/tags?post=303"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}