The name SOUNDBOT is a concatenation of two radical taken from the words SOUND and roBOT. The SOUNDBOT to refer to a open standard domestic sounds and noise robot to provide reliable robotic services interoperation, and integration with human and other smart home and/or remote monitoring services.
The SOUNDBOT experiment is envisaged as a mobile robot with ears (audio-sensors) and natural hearing and listening sense (Unit for acoustic processing) to detect acoustics events, recognise activities and contextual cases to provide interaction for human collaboration and object cooperation services. SOUNDBOT acts as human listening and capable of performing a variety of complex human tasks on command via voice/graphical user interfaces or programmatically via the SOUNDBOT platform.
Discussion of the ANSWER to the pre-proposal checkform
Distribution of competences and WPs
12:00 12:15 Introductions, available material and proposed actions
12:15 12:30 SOUNDITCS (FET-OPEN) and FLAGSHIP
12:30 12:45 SOUNDS FOR ENERGY EFFICIENT BUILDINGS + SOUNDS FOR EFFICIENT CITIES
12:45 13:00 Round of questions
Acoustic Computing for Artificial Intelligent and Ambient Intelligent Applications
ACAIA.org April-2010
April 2010 issue
Latest News: After the discussions in fresh ideas and future visions we already have submitted and received this week the feedback of the FET officers for preparing the STREP and work in the Flagship.We can meet on 9-10 June in BXL (we will present ACAIA idea)
A robot that cannot hear is as a person that is deaf.
Problems that deaf people have are so much, for instance the traffic light that shows green, but the ambulance using his horn that comes from behind and should be given way. The deaf do not know that the ambulance is coming. Sound can help to make a better decision about priority in this case. The deaf person detects the green light and walks. The hearing person detects the green light AND the horn and doesn’t walk.
SoundBOT ICT6 2.1c IP Awaiting the results of the evaluation on June
ACAIA Network Session proposal at ICT2010 on September in BXL
awaiting response
SoundITCs FET-OPEN STREP in preparation for submission on September
SoundBot. ICT6 3.2.c) Cognition and Robotics: Awaiting the results of the SoundBOT IP proposal. The first feedbacks received from our NCPs seems that our success should depend on the consideration of the term "complete robot" and the embodying Audition into Robots as a new interesting integration.
SoundITCs. ICT models of Audition in Robotics: We have started the preparation of the STREP (two stages Jun/Sep) . Partners have to send their potential contributions to the different WPs as subsystems/models that we have sent. If you dont have received yet please send us your LoI. We are preparing a second pre-proposal for sending. After that we could meet at the Flagship worshop on 9-10Jun10 in BXL. We could discuss the short proposal evaluation and the details of the final proposal. In addition we can discuss opportunities of ACAIA community and AUDITION for ROBOTICS topic to join some of this raising Flaships.
S4EeB. Launching the resubmission and Building up a new consortium for a new proposal for Smart Cities:Looking at the presentation of Peter AIT in the Infoday of BXL and the rest of attendants we have to prepare an high quality proposal. In other hand there is opportunity to send a new proposal for the topic of Smart Cities All interested partners in participate has to send a LoI including potential contributions (Open Jul and close Nov).
SoundSec. Starting the study of the draft and the feedbacks of ideas:Seems that we have some chair into some topics of the draft of the call to be confirmed on July. We follow working for elaborate the proposal idea for starting the discussions (Open Jul and close Nov).
ICT2010. Proposal for Network Session:Sent the proposal now we have to await if it is approved. This will be a perfect opportunity to organise the second Conference in Acoustiuc Computing for AI/AmI Applications on 27-29 September 2010 in BXL. Before SoundSec and S4EeB proposals.
ACAIA promising visionary idea is to exploit the information associate to sounds for all Artificial Intelligent system and the benefit of the Knowledge Society.
ACAIA Identifies a unifying scientific goal, Audition for all Robots
ACAIA problem is that the justification of the installation of audio-sensors networks for acquiring sounds is the auditory capacity, or the power to extract the useful associated information and the ability to understand it. The complete audition faculties is reached with the multimodal reasoning that embodies consciousness to the robot and facilitates building natural and human robots.
ACAIA solution is distributing and sharing development and experience on Internet to be available for all robots from semantic repositories of Acoustic Processing, Multi-sensory Cognition and Robotics Reasoning, forming and populating two meshes of Artificial Sensing, Thinking and Acting and systems learning, teaching, training and factoring that allow the personalisation, adaptation ande evolution of Artificial Intelligence.
ACAIA exploring and modelling concepts and technological of Acoustics processing, Artificial Audition, Cognition and Robotics that can contribute to Multi-Modal sensing and reasoning and Consciousness of robotics challenges of long term importance for Europe.
Acoustic Computing for Artificial Intelligence and Ambient Intelligent Applications
SoundITCs aims to investigate ICT models for endowing auditory capacities and embodying Audition faculties into robotics systems and proof-of-concept of acoustic and cognition processing and robotic systems to leverage the rich though untapped potential of the information associated to sounds, by simulation of human audition perception and understanding and multi-sensory reasoning and populate the capabilities on Internet to provide personality, adaptation and evolution to Artificial Intelligent.
The main strand of breakthroughs are: A1 ACQUISITION Audio-sensors and audio-sensors networks A2 PERCEPTION Acoustic Events Detection (AED). Representation, classification and compression A3 RECOGNITION Analysis of sets of AED for Activity Recognition A4 REFLECTION Multi-sensory analysis (audio/video mainly) A5 DECISION. risk-management consideration and decision-making of actuations. A6 ACTUATION, Four types of robotic services: Informing, Alerting, Assisting and Monitoring. A7 PERSONALISATION Behaviour of interactions (configuration of applications via parametrisation) A8 ADAPTATION. Robotics user-training, such a domestic pet. A9 EVOLUTION. Continuous process of automatic upgrading from Internet and user-training
Sounds for efficient Cities (S4eC)
EEB-ICT-2011 ICT for energy-positive neighbourhoods
Sounds for Efficient Cities aims to contribute globally to the European Energy-Efficient Buildings Initiative by developing more accurate and intelligent management and control systems and decision support systems monitoring via domotics and urbiotics in quasi real-time the types of occupancy and activities for addressing the dynamics of energy supply and demand in neighbourhoods and extended urban/rural communities.
The Digital Control Centre (DCC) systems of Sounds for Efficient Cities will optimise the use of energy beyond the public buildings and the street lighting and optimise the balance for provision and consumption and the integration of renewable energy sources and the connection to the energy grid in order to take advantage of variable tariffs and diversity of supply.
The technical developments and implementations of the audio-sensor networks (ASN) in urbiotics and domotics systems will address the necessary convincing and reliable business models of microphones, audio-processing cards and energy DCC devices, splitting incentives based on applications and services on “The future of Internet of Things Sounds and Energy Services”, engage end users with low cost multifunctional “set tool box” and the commitment of public authorities to the deployment of such systems for saving expenses and pay-back in short time the investment.
Interoperation of the Sounds for Efficient Cities systems with other ICT systems, such as security and traffic management systems in urbiotics sensing and Inclusions and Health care services in domotics systems that may be deployed in the area will be considered an asset.
In addition to Sounds for Efficient Cities systems will include a substantial experimental validation and benchmarking phase. During this phase, Sounds for Efficient Cities will demonstrate in real situations and operative cases and measure the qualitative and quantitative benefits and total cost of operation for use by those planning to deploy and finance such systems and draw lessons which can be replicated.
Robowatch Technologies GmbH Pankstr.8 -10 Haus C D -13127 Berlín Alemania Tel.+49 30 4749 8860 Fax +49 30 4749 8866 info@robowatch.de www.robowatch.de
MOBILE SURVEILLANCE AND RECONNAISSANCE SYSTEMS <!-- Text: [begin] -->
Robowatch Technologies Ltd. is a leading robotics company based in Berlin, Germany. We develop and distribute mobile security robots, which relieve humans of dangerous tasks, warn, and protect people in hazardous situations.
The robots are used, for instance, to support security personnel in the surveillance of perilous zones or very large areas. They are also used to optimise the reconnaissance activities of action forces in civil defence and disaster prevention.
Our products offer reliable protection everywhere, where the monitoring of an object by humans is dangerously, unreasonably or not eligible for financing. With this development projection and constantly new technical solutions we stand at the point of the industry.
'Developing cognitive robots whose "purpose in life" would be to serve humans as assistants or "companions". Such robots would be able to learn new skills and tasks in an active open-ended way and to grow in constant interaction and co-operation with humans. (excerpt of the European Commission's 'Beyond Robotics' workprogramme
The development of a robot, which would serve humans in their daily life, poses several important challenges to Robotics and Artificial Intelligence.
As a mechatronic device, the robot should of course be endowed with mobility and manipulation capacities in human environments. But the robot being in permanent interaction with humans in a changing context and varying situations, it cannot be only considered as a ready-made device.
The dynamics of the tasks it has to achieve, of the situations, and of the interactions with humans, imply that such a robot companion will have to exhibit cognitive capacities for adapting its behaviour in changing situations and for various tasks.
It will have to understand its spatial surroundings, to make decisions according to its own evaluation of its situation, and to interact closely with humans both physically and through multimodal dialogue.
Such capacities cannot be pre-programmed, but will have to be learned, and open-ended learning processes will therefore be at the core of every function, all the more as the robot companion must also adapt to its human partner and learn new skills to be able to respond to the humans' needs.
To see the project's vision in a video click here.
The overall objectives of this project are to study the perceptual, representational, reasoning and learning capabilities of embodied robots in human centred environments. The project develops methods and technologies for the construction of such cognitive robots, able to evolve and grow their capacities in close interaction with humans in an open-ended fashion.
Expected results are basic methods, algorithms and architectures and their integration and long-term experimentation and scientific evaluation on embodied robotic systems in different settings and situations.
In the focus of this research endeavour is the development of a robot whose ultimate task is to serve humans as a companion in their daily life. The robot is not only considered as a ready-made device but as an artificial creature, which improves its capabilities in a continuous process of acquiring new knowledge and skills.
Besides the necessary functions for sensing, moving and acting, such a robot will exhibit the cognitive capacities enabling it to focus its attention, to understand the spatial and dynamic structure of its environment and to interact with it, to exhibit a social behaviour and communicate with other agents and with humans at the appropriate level of abstraction according to context.
The design of the cognitive functions of this artificial creature and the study and development of the continuous learning, training and education process in the course of which it will mature to a true companion, are the central research themes of the project.
Human-robot collaboration requires both communicative and decision making skills of a robot. To enable flexible coordination and turn-taking between human users and a robot in joint tasks, the robot's dialog and decision making mechanism have to be synchronized in a meaningful way. In this paper, we propose a integration framework to combine the dialog and the decision making processes. With this framework, we investigate various task negotiation situations for a social robot in a fetch-and-carry scenario. For the technical realization of the framework, the interface specification between the dialog and the decision making systems is also presented. Further, we discuss several challenging issues identified in our integration effort that should be adddressed in the future.
In studies of human-robot communication we have observed that the robot's inability to give appropriate communicative feedback for contact and perception causes miscommunication. The purpose of this paper is to motivate the need for and give an initial characterisation of a model for communicative contact and perception feedback. One component of a such a model could be a kind of low-level user model aiming to decide the perceptual status of the user. We have analysed an example of interaction that shows signs of miscommunication that is related to feedback problems concerning contact and perception. Using the analysis to pinpoint the source of these problems we provide an initial account for the type of information sources that a low- level user that handles communicative contact and perception feedback should comprise. We also provide two design examples that in our view motivates this explorative effort.
In this paper we provide a possible characterisation of user behaviour based on an analysis of a corpus of human-robot communication, collected by using the Wizard-of-Oz technique to elicit communicative behaviour. We distinguish between three general types of user behaviour: uni- form user behaviour, idiosyncratic user behaviour and distinguishing user behaviour. We also present an analysis of user behaviour that can be characterized in terms of overall task organi- sation (i.e., interaction episodes) and behaviour that is intimately connected to communicative behaviour. We also discuss to what extent manipulation of objects to prepare the environment can be used to group users along the dimensions: task- vs. interaction-orientation and control vs. monitoring. Using this typology we discuss categorisation into four dimensions of use: Directors, Manipulators, Pointers and Players. To support these use dimensions we propose a set of adaptation foci (Focus on Feedback or Action, and on Proactive or Reactive behaviour).
The goal of the current work was to develop an interaction management system for a mobile robot companion. In comparison to many desktop computer applications, such a robot poses a number of new scientific questions that need to be addressed by its interaction management system. More specifically, the interaction management system should fulfill the following 8 requirements of Human-Robot Interaction (HRI): (1) handle cooperative interaction, (2) en- able mixed-initiative interaction style, (3) separate interaction from domain task execution, (4) account for multi-modality of embodied interaction, (5) facilitate recognition of interaction ini- tiated by users, (6) make use of different modalities in a meaningful way, (7) enable social behaviors, and (8) contribute to the usability of the entire robot system. In the current work, a powerful computational model of multi-modal grounding, the MMPDA model, was proposed. This model improves the existing grounding models and naturally supports multi-modal em- bodied communication in HRI. For the robot BIRON, the MMPDA model was implemented within two Implementation-Evaluation-Cycles (IEC), in which users played an important role in the determination of implementation foci. In the course of the IECs, various functions and behaviors were realized and their benefits were also proven in the user studies of each cycle. The MMPDA model and the implemented system completely fulfill the above eight requirements of HRI.
Verbal and non-verbal interaction capabilities for robots are often studied isolated from each other in current research trend because they largely contribute to different aspects of interaction. For a robot companion that needs to be both useful and social, however, these capabilities have to be considered in a unified, complex interaction context. In this paper we present two case studies in such a context that clearly reveal the strengths and limitations of these modalities and advocate their complementary benefits for human-robot interaction. Motivated by this evidence we propose a powerful interaction framework which addresses common features of interactional and propositional information instead of their differences, as popular in many other works in this field, and models them using one single principle: grounding.
AAAI Spring Symposium 2007 on Interaction Challenges for Intelligent Assistants (best paper award)
Manja Lohse, Katharina Rohlfing, Britta Wrede, Gerhard Sagerer
This paper investigates the influence of feedback provided by an autonomous robot (BIRON) on users' discursive behavior. A user study is described during which users show objects to the robot. The results of the experiment indicate, that the robot's verbal feedback utterances cause the humans to adapt their own way of speaking. The changes in users' verbal behavior are due to their beliefs about the robots knowledge and abilities. In this paper they are identified and grouped. Moreover, the data implies variations in user behavior regarding gestures. Unlike speech, the robot was not able to give feedback with gestures. Due to the lack of feedback, users did not seem to have a consistent mental representation of the robot's abilities to recognize gestures. As a result, changes between different gestures are interpreted to be unconscious variations accompanying speech.
Manja Lohse, Marc Hanheide, Britta Wrede, Michael L. Walters, Kheng Lee Koay, Dag Sverre Syrdal, Anders Green, Helge Huttenrauch, Kerstin Dautenhahn, Gerhard Sagerer, and Kerstin Severinson-Eklundh
Human-robot interaction (HRI) as research field is emerging with new social robots that have to be able to interact with inexperienced users. In the design of these robots many research findings of human-human interaction and human-computer interaction are applied. Nevertheless, the direct applicability of these theories is limited since a robot is different from a human as well as from a computer. Therefore, new user centred methods have to be applied in HRI in order to build robots that are suitable for inexperienced users. In this paper we present a video study we conducted with our robot BIRON (BIelefeld RObot companioN), which is build to be used in a domestic environment. Subjects watch the system during the interaction with a human and rate two different robot behaviours (extrovert; introvert). The behaviours differ regarding verbal output und person following of the robot. The goal is to compare the ratings of the behaviours and to improve human-robot interaction with the help of the results.
Thorsten P. Spexard, Shuyin Li, Britta Wrede, Marc Hanheide, Elin A. Topp, Helge Huttenrauch
An important goal for research on service robots is the cooperation of a human and a robot as team. A service robot in a domestic environment needs to build a representation of its future workspace that corresponds to the human user's understanding of these surroundings. But it also needs to apply this model about the "where" and "what" in its current interaction to allow communication about objects and places in a human-adequate way. In this paper we present a first integration of a hierarchical robotic mapping system into an interactive framework con- trolled by a dialog system. The goal is to use interactively acquired environment models to implement a robot with interaction aware behaviors. A major contribution of this work is a three-level hierarchy of spatial representation affecting three different communication dimen- sions. This hierarchy is consequently applied in the design of the grounding-based dialog, laser-based topological mapping, and an objects attention system. We demonstrate the benefits of this integration for learning and tour guiding in a human comprehensible interaction between a robot and its user in a home-tour scenario. The enhanced interaction capabilities are crucial for developing a new generation of robots that will be accepted not only as service robots but also as robot companions.
This article presents a key-scenario of H/R interaction for our tourguide robot. Given this scenario, three visual modalities, the robot deals with, have been outlined, namely the "search of visitors" attending the exhibition, the "proximal interaction" through the robot interface and the "guidance mission". The paper focuses on the two last ones which involves face recognition and visual data fusion in a particle filtering framework. Evaluations on key-sequences in a human centred environment show the tracker robustness to background clutter, sporadic occlusions and group of persons. The tracker is able to cope with target loss by detecting and re-initializing automatically thanks to the face recognition outcome. Moreover, the multi-cues association proved to be more robust to clutter than any of the cues individually.
This paper deals with visual recognition and tracking of people and gestures from a camera mounted on a tour-guide robot in a human, cluttered, environment. The particle filtering framework enables the fusion of visual cues, both into an importance function from which the particles are sampled, and into a measurement model used for weights definition. The multi-cues associations prove to be more robust than any of the cues individually. For the purpose of gestures recognition, a tracker is proposed which handles multiple hand configurations templates. Finally, implementation and experiments on a tour-guide robot are presented in order to highlight the relevance and complementarity of the developed visual functions. Extensions are finally discussed.
The MUSIC algorithm (MUltiple SIgnal Classification) is a well-known high-resolution method to sound source localization. However, as it is essentially narrowband, several extensions can be envisaged when dealing with broadband sources like human voice. This paper presents such extensions and proposes a comparative study w.r.t. specific robotics constraints. An online beamspace MUSIC method, together with a recently developed beamforming scheme, are shown to constitute a mathematically sound and potentially efficient solution.
This paper addresses the problem of people detection using 2D range data and omnidirectional vision. The range data is often used in robotics to detect person legs. First, we will develop a reliable 2D range data leg detector using the AdaBoost algorithm. Second, instead of using the leg detector to detect people, we propose a more reliable method that takes into account the spatial arrangement of the detected legs. The method is inspired by the latest results on the "partbased representations" from the computer vision area. Finally, we show how the part-based representations can be used to combine the leg detections from the 2D range data and upper body, lower body and full body detections from omnidirectional camera. The experimental results show highly reliable people detection when the two sensors are combined. We use Haarlike features and the AdaBoost again to construct the omnicam body parts detectors.
IEEE/RSJ International Conference on Intelligent Robots and Systems. (2007)
We present a foveated vision system for robust person detection in wide field of view. The system consists of an omnidirectional camera for people detection in a wide field of view and a pan-tilt camera that can focus on a particular location. Combining the information from both cameras leads to improved people detection. The people detection is based on human body part detectors and a probabilistic model of the spatial arrangement of the parts. The representation is robust to partial occlusions part detector false alarms and missed detections of body parts. We also show how to use the fact that the persons walk on a known ground plane to increase the efficiency and reliability of the detection.
IEEE International Conference on Computer Vision Systems. (2007)
The "layered image model" represents an image sequence as a composition of 2D layers where each layer corresponds to a different object. A layer is described by its appearance and its transparency mask. The transparency masks are used to combine the layers. In this paper we present a probabilistic layered model that uses the "logistic principal component analysis (PCA)" to describe the masks. The Gaussian based factor analysis was used previously but it does not consider the constraints imposed on the transparency values. The "logistic PCA" models the transparency values that are between 0 and 1 more naturally using Bernoulli distributions. The presented model can be used to automatically extract low dimensional representation of the transparency maps of the moving objects from a video sequences more efficiently.
Mathias Fontmarty, Frédéric Lerasle and Patrick Danès
This paper presents a new algorithm for human motion three-dimensional tracking based on a stereo camera system embedded on a mobile robot. The approach mixes advantages of the well-known ICONDENSATION and Annealed particle filters into a more reliable "I-Annealed" particle filter based tracker. Data fusion is also studied to show that a wide variety of visual cues must be used so that the system can adapt to various backgrounds. A complete implementation of the proposed tracker is described as well as some results on indoor sequences. Finally, evolutions and future work are discussed.
M. Lösch, S. R. Schmidt-Rohr, S. Knoop, R. Dillmann
Human activity recognition is an essential ability for service robots and other robotic systems which interact with human beings. To be proactive, the system must be able to evaluate the current state of the user it is dealing with. Future surveillance systems will also benefit from robust activity recognition if real time constraints are met, allowing to automate tasks that still have to be fulfilled by humans. In this paper, a new approach for the feature selection in human activity recognition is proposed. Typically, the features are chosen with respect to the relevance of the features for the classification of the activity which shall be recognized. Our new approach extends this process by involving background knowledge about the features and active user engagement. By building a taxonomy over the complete feature set, a user can interactively guide and refine the selection process and thereby avoid certain problems which are common if noisy or small amounts of training data are used to train the system.
Proceedings of the International Conference on Cognitive Systems (CogSys 2008).
Zhe Li, Sven Wachsmuth, Jannik Fritsch, and Gerhard Sagerer
This paper puts forward an approach for a mobile robot to recognize the human's manipulative actions from different single camera views. While most of the related work in action recognition assume a fixed static camera view that is the same for training and testing, such kind of constraints do not apply for mobile robot companions. We propose a recognition scheme that is able to generalize an action model, that has been learned from a very few data items observed from a single camera view, to variant view points and different settings. We tackle the problem of compensating the view dependence of 2D motion models on three different levels. Firstly, we pre-segment the trajectories based on an object vicinity that depends on the camera tilt and object detections. Secondly, an interactive feature vector is designed that represents the relative movements between the human hand and the objects. Thirdly, we propose an adaptive HMM-based matching process that is based on a particle filter and includes a dynamically adjusted scaling parameter that models the systematic error of the view dependency. Finally, we use a two-layered approach for task recognition which decouples the task knowledge from the view dependent primitive recognition. The results of experiments in an office environment show the applicability of this approach
Joe Saunders, Nuno Otero and Chrystopher L. Nehaniv
Teaching a robot new skills may require that the teacher scaffolds the teaching experience appropriately. However, due to inherent assumptions made by a human teacher the scaffolding process may in some circumstances fail to effectively teach the robot. Here we illustrate this issue in two simple robot teaching exploratory studies and examine the assumptions made by the teacher when teaching the robot. In the first study the human teacher had to reason about robot perceived states in order to provide suitable teaching. In the second study the human teachers had to understand the perceptual constraints of the robot based on the instructions given beforehand by the experimenter and subsequently adapt the guidance given. The results suggest that although the two tasks are quite distinct in their level of complexity a common thread can be observed: people tend to underspecify their teaching. It seems that steps of the explanation are assumed to be known and skipped or not even considered at all. We reflect on the possibility that one of the major challenges in designing robots that are capable interaction partners in these teaching situations is to be able to make them communicate their internal state and current capabilities effectively. Furthermore, we also reflect on designing appropriate behavioral primitives for the robot, corresponding implications on the level of task description and for benefiting from human teaching.
This article discusses challenges of Human-Robot Interaction, which is a highly inter- and multidisciplinary area. Themes that are important in current research in this lively and growing field are identified and selected work relevant to these themes is discussed.
International Journal of Advanced Robotic Systems 4(1) pp.103-108
Social intelligence in robots has a quite recent histor y in artificial intelligence and robotics. However, it has become increasingly apparent that social and interactive skills are necessary requirements in many application areas and contexts where robots need to interact and collaborate with other robots or humans. Research on human-robot interaction (HRI) poses many challenges regarding the nature of interactivity and 'social behaviour' in robot and humans. The first par t of this paper addresses dimensions of HRI, discussing requirements on social skills for robots and introducing the conceptual space of HRI studies. In order to illustrate these concepts, two examples of HRI research are presented. First, research is sur veyed which investigates the development of a cognitive robot companion. The aim of this work is to develop social rules for robot behaviour (a 'robotiquette') that is comfortable and acceptable to humans. Second, robots are discussed as possible educational or therapeutic toys for children with autism. The concept of interactive emergence in human-child interactions is highlighted. Different types of play among children are discussed in the light of their potential investigation in human-robot experiments. The paper concludes by examining different paradigms regarding 'social relationships' of robots and people interacting with them.
Philosophical Transactions of the Royal Society B: Biological Sciences,362(1480),pp.679-704.
K.L. Koay, E.A. Sisbot, D.S. Syrdal, M.L. Walters, K. Dautenhahn and R. Alami
This paper presents the results from a Human-Robot Interaction study that investigates the issues of participants' preferences in terms of the robot approach direction (directionRAD), robot base approach interaction distance (distanceRBAID), robot handing over hand distance (distanceRHOHD), robot handing over arm gesture (gestureRHOAG), and the coordination of both the robot approaching and gestureRHOAG in the context of a robot handing over an object to a seated person. The results from this study aim at informing the development of a Human Aware Manipulation Planner. Twelve participants with some previous human-robot interaction experience were recruited for the trial. The results show that a majority of the participants prefer the robot to approach from the front and hand them a can of soft drink in the front sector of their personal zone. The robot handing over hand position had the most influence on determining from where the robot should approach (i.e distanceRAD). Legibility and perception of risk seem to be the deciding factor on how participants choose their preferred robot arm-base approach coordination for handing over a can.
In Proceedings of AAAI - SpringSymposium2007: SS07, Multidisciplinary Collaborationfor Socially Assistive Robotics botics , AAAI Technical Report, Stanford University, Palo Alto, Ca, USA. pp. 18 24 AAAI Press
Kheng Lee Koay, Dag Sverre Syrdal, Michael L. Walters and Kerstin Dautenhahn,
This paper presents and discusses a longitudinal study which investigated habituation effects between humans and robots over a period of five weeks. Participants' prefer- ences for the robot's approach distance with respect to its ap- proach direction and appearance were investigated in a variety of domestic scenarios. These human-robot interaction (HRI) scenarios were also designed to explore the notions of auton- omy and control. The results of this study show that participants' preferences change over time as the participants habituate to the robot. This trend was significant in terms of the robot's appearance and approach direction. Also, it seems to indicate that partici- pants who are accustomed to the robot prefer to be more 'in control' of the situation - in that they appreciated reduced ro- bot autonomy in case of unexpected events.
In Proceedings of the 16th IEEE International Workshop on Robot and HumanInteractiveCommunication (RO-MAN 2007), South Korea. pp. 564 569
Nuno Otero, Aris Alissandrakis, Kerstin Dautenhahn, Chrystopher Nehaniv, Dag Sverre Syrdal and Kheng Lee Koay
In this paper, we explore some conceptual issues, relevant for the design of robotic systems aimed at interacting with humans in domestic environments. More specifically, we study the role of the robot's feedback (positive or negative acknowledgment of understanding) on a human teacher's demonstration of a routine home task (laying a table). Both the human and the system's perspectives are considered in the analysis and discussion of results from a human-robot user study, highlighting some important conceptual and practical issues. These include the lack of explicitness and consistency on people's demonstration strategies. Furthermore, we discuss the need to investigate design strategies to elicit people's knowledge about the task and also successfully advertize the robot's abilities in order to promote people's ability to provide appropriate demonstrations.
Nuno Otero, Joe Saunders, Kerstin Dautenhahn, Chrystopher L. Nehaniv
For robots to be more capable interaction partners they will necessarily need to adapt to the needs and requirements of their human companions. One way that the human could aid this adaptation may be by teaching the robot new ways of doing things by physically demonstrating different behaviours and tasks such that the robot learns new skills by imitating the learnt behaviours in appropriate contexts. In human-human teaching the concept of scaffolding describes the process whereby the teacher guides the pupil to new competence levels by exploiting and extending existing competencies. In addition the idea of event structuring can be used to describe how the teacher highlights important moments in the overall interaction episode. Scaffolding and event structuring robot skills in this way may be an attractive route in achieving robot adaptation, however there are many ways in which a particular behaviour might be scaffolded or structured and the interaction process itself may have an effect on the robot's resulting performance. Our overall research goal is to understand how to design an appropriate human-robot interaction scenario where the robot will be able to intervene and elicit knowledge from the human teacher in order to better understand the taught behaviour. In this article we examine some of these issues in two exploratory human-robot teaching scenarios. The first considers task structuring from the robot's viewpoint by varying the way in which a robot is taught. The experimental results illustrate that the way in which teaching is carried out, and primarily how the teaching steps are decomposed, has a critical effect on the efficiency of human teaching and the effectiveness of robot learning. The second experiment studies the problem from the human viewpoint in an attempt to study the human teacher's spontaneous levels of event segmentation when analysing their own demonstrations of a routine home task to a robot. The results suggest the existence of some individual differences regarding the level of granularity spontaneously considered for the task segmentation and for those moments in the interaction which are viewed as most important.
In Connection Science, Vol. 00, No. 00, January 2008, 1-26
Emrah Akin Sisbot, Luis F. Marin-Urias, Rachid Alami, and Thierry Simeon,
Robot navigation in the presence of humans raises new issues for motion planning and control when the humans must be taken explicitly into account. We claim that a human aware motion planner (HAMP) must not only provide safe robot paths, but also synthesize good, socially acceptable and legible paths. This paper focuses on a motion planner that takes explicitly into account its human partners by reasoning about their accessibility, their vision field and their preferences in terms of relative human-robot placement and motions in realistic environments. This planner is part of a human-aware motion and manipulation planning and control system that we aim to develop in order to achieve motion and manipulation tasks in the presence or in synergy with humans.
Robots' interaction with humans raises new issues for geometrical reasoning where the humans must be taken explicitly into account. We claim that a human-aware motion system must not only elaborate safe robot motions, but also synthesize good, socially acceptable and legible movement. This paper focuses on a manipulation planner and a place- ment mechanism that take explicitly into account its human partners by reasoning about their accessibility, their vision field and their preferences. This planner is part of a human-aware motion and manipulation planning and control system that we aim to develop in order to achieve motion and manipulation tasks in presence or in synergy with humans.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems(IROS07), San Diego, USA.
Dag Sverre Syrdal, Michael L. Walters, Kheng Lee Koay, Sarah N. Woods, Kerstin Dautenhahn
Technical report of the AAAI Spring Symposiumon Multidisciplinary Collaboration for Socially Assistive Robotics, (AAAI SS07-2007), Stanford University, Palo Alto, CA, USA.
Dag Sverre Syrdal, Kheng Lee Koay, Michael L. Walters and Kerstin Dautenhahn
This study investigated the relationship between individual differences and proxemic behaviour in an HRI setting involving a robot approaching a person. In total 33 participants took part in three different scenarios; verbal interaction, physical interaction and no interaction. Participant control over the robot, and approach direction was also varied. Measurements of the preferred robot approach distance was obtained, and analysed along with the participants' demographic and personality data. The results indicate differences in approach direction preferences based on gender. Also, results show that the participants' personality traits of extraversion and conscientiousness are associated with changes in approach distance preferences according to robot autonomy. The results are discussed in light of relevant literature from the social sciences.
In Proceedingsof the 16th IEEE International Workshop on Robot and Human Interactive Communication (RO-MAN 2007), Korea. pp. 26 29
Michael L. Walters, Dag S. Syrdal, Kerstin Dautenhahn, René te Boekhorst, Kheng Lee Koay
This article presents the results of video-based Human Robot Interaction (HRI) trials which investigated people's perceptions of different robot appearances and as- sociated attention-seeking features and behaviors displayed by robots with different appearance and behaviors. The HRI trials studied the participants' preferences for various fea- tures of robot appearance and behavior, as well as their per- sonality attributions towards the robots compared to their own personalities. Overall, participants tended to prefer ro- bots with more human-like appearance and attributes. How- ever, systematic individual differences in the dynamic ap- pearance ratings are not consistent with a universal effect. Introverts and participants with lower emotional stability tended to prefer the mechanical looking appearance to a greater degree than other participants. It is also shown that it is possible to rate individual elements of a particular robot's behavior and then assess the contribution, or otherwise, of that element to the overall perception of the robot by people. Relating participants' dynamic appearance ratings of indi- vidual robots to independent static appearance ratings pro- vided evidence that could be taken to support a portion of the left hand side of Mori's theoretically proposed 'uncanny valle' diagram. Suggestions for future work are outlined.
Michael Walters, Kheng Lee Koay, Sarah Woods, Dag S. Syrdal, K. Dautenhahn
This paper presents results from Human Robot Interaction (HRI) trials carried out at the University of Hertfordshire which examined how a robot companion must behave when fetching anf carrying objects to and from human participants in a domestic 'living room' scenario. It was found that different social approach rules apply depending whether the interacting human is sitting, standing in the open, or against a wall or obstacle. The main purpose of this paper is to demonstrate the techniques for automatically annotating HRI trial videos with participants supplied comfort data. During the trial participants indicated via Comfort Level Device (CLD) their preferences for robot approach distances. This data was correlated with distance measurements obtained from the CLD annotated video recording of the trials. The development of a post-processing technique for overlaying a structured grid on to the video recordings of the trial, allowed a continuous record of both human and robot's positions and distances during the HRI trials to be recorded. Implications and suggestions for further HRI trials and an improvement of the methodology conclude the paper.
In Technical Report of the AAAI Spring Symposium on Multidisciplinary Collaboration for Socially Assistive Robotics, (AAAI SS07-2007), Stanford University, Palo Alto, CA, USA.
Michael Walters, Kerstin Dautenhahn, Sarah Woods, Kheng Lee Koay
This paper presents the collected results, outcomes and conclusions from a series of Human Robot Interaction (HRI) trials which addressed the question of how a robot should approach a human in a fetch and carry task. Two pilot trials were carried out, aiding the development of a main HRI trial with four different approach contexts and a larger subject sample size. The findings from the pilot trials were confirmed and expanded upon. Most subjects disliked a frontal approach when seated, except for a small minority of female subjects. In general, seated humans do not like to be approached by a robot directly from the front even when seated behind a table. A frontal approach is more acceptable when a human is standing in an open area. Most subjects preferred to be approached from either the left or right side, with a small overall preference for a right approach by the robot. However, this is not a strong preference and it may be disregarded if it is more physically convenient to approach from a left front direction. Handedness and occupation were not related to these preferences. Subjects do not usually like the robot to move or approach from directly behind them, preferring the robot to be in view even if this means the robot taking a physically non-optimum path. The subjects for the main HRI trials had no previous experience of interacting with robots. Future research aims are outlined and include the necessity of carrying out longitudinal trials to see if these findings hold over a longer period of exposure to robots.
In Proceedings of the 2nd ACM SIGCHI/SIGART Conference on Human-Robot Interaction (HRI 07), Washington DC, USA. pp. 317 324
M. L. Walters, K. Dautenhahn, R. te Boekhorst, K. L. Koay, S. N. Woods
This paper presents the results of video based Human Robot Interaction (HRI) trials which investigated people's perceptions of different robot appearances and associated attention seeking features and behaviors displayed by the robot. The methodological approach highlights the 'holistic' and embodied nature of robot appearance and behavior. Results show that people tend to rate a particular behavior less favorably when the behavior is not consistent with the robot's appearance. It is shown how participants' ratings of robot dynamic appearance are influenced by the robot's behavior. Relating participants' dynamic appearance ratings of individual robots to independently rated static appearance provides support for the left hand side of Mori's proposed ``uncanny valley'' diagram. We exemplify how to rate individual elements of a particular robot's behavior and then assess the contribution of those elements to the overall perception of the robot by people. Suggestions for future work are outlined.
In Proceedings of IEEE-Artificial Life (Alife 07), Honolulu, Hawaii, USA. pp. 341 347
We present an approach to teach incrementally human gestures to a humanoid robot. The learning process consists of first projecting the movement data in a latent space and encoding the resulting signals in a Gaussian Mixture Model (GMM). We compare the performance of two incremental training procedures against a batch training procedure. Qualitative and quantitative evaluations are performed on data acquired from motion sensors attached to a human demonstrator and data acquired by kinesthetically demonstrating the task to the robot. We present experiments to show that these different modalities can be used to teach incrementally basketball officials' signals to a HOAP-3 humanoid robot.
Proceedings of the ACM/IEEE International Conference on Human-Robot Interaction (HRI), pp. 255-262, 2007.
Guenter, F., Hersch, M., Calinon, S. and Billard, A.
The goal of developing algorithms for programming robots by demonstration is to create an easy way of programming robots such that it can be accomplished by anyone. When a demonstrator teaches a task to a robot, he/she shows some ways of fulfilling the task, but not all the possibilities. The robot must then be able to reproduce the task even when unexpected perturbations occur. In this case, it has to learn a new solution. In this paper, we describe a system to teach to the robot constrained reaching tasks. Our system is based on a dynamical system generator modulated by a learned speed tra jectory. This system is combined with a reinforcement learning module to allow the robot to adapt the tra jectory when facing a new situation, for example in the presence of obstacles.
RSJ Advanced Robotics, Vol. 21, No. 13, pp. 1521-1544, 2007.
Teaching a robot new skills may require that the teacher scaffolds the teaching experience appropriately. However, due to inherent assumptions made by a human teacher the scaffolding process may in some circumstances fail to effectively teach the robot. Here we illustrate this issue in two simple robot teaching exploratory studies and examine the assumptions made by the teacher when teaching the robot. In the first study the human teacher had to reason about robot perceived states in order to provide suitable teaching. In the second study the human teachers had to understand the perceptual constraints of the robot based on the instructions given beforehand by the experimenter and subsequently adapt the guidance given. The results suggest that although the two tasks are quite distinct in their level of complexity a common thread can be observed: people tend to underspecify their teaching. It seems that steps of the explanation are assumed to be known and skipped or not even considered at all. We reflect on the possibility that one of the major challenges in designing robots that are capable interaction partners in these teaching situations is to be able to make them communicate their internal state and current capabilities effectively. Furthermore, we also reflect on designing appropriate behavioral primitives for the robot, corresponding implications on the level of task description and for benefiting from human teaching.
In Proc. 16th IEEE International Symposium on Robot and Human Interactive Communication (IEEE Ro-Man 2007), pages 708 713. IEEE, Jeju Island, Korea.
Otero, N., Saunders, J., Nehaniv, C. L., and Dautenhahn, K.
For robots to be more capable interaction partners they will necessarily need to adapt to the needs and requirements of their human companions. One way that the human could aid this adaptation may be by teaching the robot new ways of doing things by physically demonstrating different behaviours and tasks such that the robot learns new skills by imitating the learnt behaviours in appropriate contexts. In human-human teaching the concept of scaffolding describes the process whereby the teacher guides the pupil to new competence levels by exploiting and extending existing competencies. In addition the idea of event structuring can be used to describe how the teacher highlights important moments in the overall interaction episode. Scaffolding and event structuring robot skills in this way may be an attractive route in achieving robot adaptation, however there are many ways in which a particular behaviour might be scaffolded or structured and the interaction process itself may have an effect on the robot’s resulting performance. Our overall research goal is to understand how to design an appropriate human-robot interaction scenario where the robot will be able to intervene and elicit knowledge from the human teacher in order to better understand the taught behaviour. In this article we examine some of these issues in two exploratory human-robot teaching scenarios. The first considers task structuring from the robot’s viewpoint by varying the way in which a robot is taught. The experimental results illustrate that the way in which teaching is carried out, and primarily how the teaching steps are decomposed, has a critical effect on the efficiency of human teaching and the effectiveness of robot learning. The second experiment studies the problem from the human viewpoint in an attempt to study the human teacher’s spontaneous levels of event segmentation when analysing their own demonstrations of a routine home task to a robot. The results suggest the existence of some individual differences regarding the level of granularity spontaneously considered for the task segmentation and for those moments in the interaction which are viewed as most important.
Otero, N., Alissandrakis, A., Dautenhahn, K., Nehaniv, C. L., Srydal, D., and Koay, K. L.
In this paper, we explore some conceptual issues, relevant for the design of robotic systems aimed at interacting with humans in domestic environments. More specifically, we study the role of the robot's feedback (positive or negative acknowledgment of understanding) on a human teacher's demonstration of a routine home task (laying a table). Both the human and the system's perspectives are considered in the analysis and discussion of results from a human-robot user study, highlighting some important conceptual and practical issues. These include the lack of explicitness and consistency on people's demonstration strategies. Furthermore, we discuss the need to investigate design strategies to elicit people's knowledge about the task and also successfully advertize the robot's abilities in order to promote people's ability to provide appropriate demonstrations.
In Proceedings of the ACM Conference on Human Robot Interaction (ACM HRI 2008). Amsterdam, Netherlands.
Learning is a core feature of future household robot systems. Nonetheless, present-day learning approaches fail to take into account that learning is never a finished process but an everyday task for biological systems. Additionally, humans always learn a various number of different tasks at the same time. This paper proposes an approach to these two problems by applying the concept of Piagetian learning to the problem of robot task learning. It proposes a method for the autonomous recognition of different task classes in the robots experiences and gives one possibility, how this task knowledge can be exploited for incremental learning of sequential reordering features of a task. This framework is evaluated using three different tasks from the household domain.
In Proc. 6th IEEE International Conference on Development and Learning, London, UK, July 2007.
Robots are expected to move from industrial environments to the household domain. Programming those robots can not be done in conventional ways since the wide variety of tasks the user needs to be accomplished can not be foreseen by the robot manufacturer. One solution to this problem is the Programming by Demonstrion (PbD) paradigm, where the user himself programs the robot by demonstrating the task to be performed. In this setting, the robot observes the user acting and infers the program that fits the users intention. One problem not adressed by current PbD systems is how to infer loop structures that are necessary to perform repetitive tasks from the demonstrations. This paper proposes a theoretical framework for efficient multi-hypotheses tracking for hierarchical robot programs covering generic loop statements called Version Space Algebra (VSA). A version space design is implemented for and evaluated with tasks from the household domain, i.e. the tasks of unloading a dishwasher basket and laying the table for multiple persons.
In Proc. 13th IASTED International Conference on Robotics and Applications, Wrzburg, Germany, August 2007.
Michael Pardowitz, Steffen Knoop, Ruediger Dillmann, Member, IEEE, and Raoul D. Zöllner
Since many years the robotics community is envisioning robot assistants sharing the same environment with humans. It became obvious that they have to interact with humans and should adapt to individual user needs. Especially the high variety of tasks robot assistants will be facing requires a highly adaptive and user-friendly programming interface. One possible solution to this programming problem is the learning-by-demonstration paradigm, where the robot is supposed to observe the execution of a task, acquire task knowledge, and reproduce it. In this paper, a system to record, interpret, and reason over demonstrations of household tasks is presented. The focus is on the model-based representation of manipulation tasks, which serves as a basis for incremental reasoning over the acquired task knowledge. The aim of the reasoning is to condense and interconnect the data, resulting in more general task knowledge. A measure for the assessment of information content of task features is introduced. This measure for the relevance of certain features relies both on general background knowledge as well as task-specific knowledge gathered from the user demonstrations. Beside the autonomous information estimation of features, speech comments during the execution, pointing out the relevance of features are considered as well. The results of the incremental growth of the task knowledge when more task demonstrations become available and their fusion with relevance information gained from speech comments is demonstrated within the task of laying a table.
IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART B: CYBERNETICS, VOL. 37, NO. 2, APRIL 2007
This paper presents an measure for image similarity based on local feature descriptions and geometric constraints. We show that on the basis of this similarity an appearance graph representation of the environment of a mobile robot can be made. This graph can be used for representing semantic information about the space, and can be used for visual navigation. The image similarity measure is robust for occlusions by people in the neighbourhood of the robot.
In Proceedings of the 3rd European Conference on Mobile Robots, pages 168-174, 2007.
Service robots designed for domestic settings need to navigate in an environment that they have to share with their users. Thus, they have to be able to report their current state and whereabouts in a way that is comprehensible for the user. Pure metric maps do not usually correspond to the understanding of the environment a user would provide. Thus, the robotic map needs to be integrated with the human representation. With our framework for Human Augmented Mapping we aim to deal with this issue and assume a guided tour as basis for an initial mapping process. During such a tour the robotic system needs to be able to detect significant changes in its environment representation - structural ambiguities - to be able to invoke a clarification discourse with the user. In this paper we present our approach to the detection of such ambiguities, that is independent from prior specification and training of particular spatial categories. We evaluate our method on data sets obtained during several runs in indoor environments in the context of a guided tour scenario.
In Proceedings of the IEEE International Conference on Robotics and Automation, Pasadena, CA, USA, May 2008. to appear.
The future of robots, as our companions is dependent on their ability to understand, interpret and represent the environment in a human compatible manner. Towards this aim, the presented work is part of an attempt to create a hierarchical probabilistic concept-oriented representation of space, based on objects. Specifically, this work details efforts taken towards learning and generating concepts and attempts to classify places using the concepts gleaned. Inference is based on the number of occurrences of various objects. The approach is based on learning from exemplars, clustering and the use of Bayesian network classifiers. Such a conceptualization and the representation that results thereof, will enable robots to be more cognizant of their surroundings and yet, compatible to us. Experiments on conceptualization and place classification are reported. Thus, the theme of the work is - conceptualization and classification for representation and spatial cognition.
In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), San Diego, USA, October 2007.
With the progress in research and product development humans and robots get more and more close to each other and the idea of a personalised general service robot is not too far fetched. Crucial for such a service robot is the ability to navigate in its working environment. The environment has to be assumed an arbitrary domestic or office-like environment that has to be shared with human users and bystanders. With methods developed and investigated in the field of simultaneous localisation and mapping it has become possible for mobile robots to explore and map an unknown environment, while they can stay localised with respect to their starting point and the surroundings. These approaches though do not consider the representation of the environment that is used by humans to refer to particular places. Robotic maps are often metric representations of features that could be obtained from sensory data. Humans have a more topological, in fact partially hierarchical way of representing environments. Especially for the communication between a user and her personal robot it is thus necessary to provide a link between the robotic map and the human understanding of the robot's workspace. The term Human Augmented Mapping is used for a framework that allows to integrate a robotic map with human concepts. Communication about the environment can thus be facilitated. By assuming an interactive setting for the map acquisition process it is possible for the user to influence the process significantly. Personal preferences can be made part of the environment representation that the robot acquires. Advantages become also obvious for the mapping process itself, since in an interactive setting the robot could ask for information and resolve ambiguities with the help of the user. Thus, a scenario of a "guided tour" in which a user can ask a robot to follow and present the surroundings is assumed as the starting point for a system for the integration of robotic mapping, interaction and human environment representations. Based on results from robotics research, psychology, human-robot interaction and cognitive science a general architecture for a system for Human Augmented Mapping is presented. This architecture combines a hierarchically organised robotic mapping approach with interaction abilities with the help of a high-level environment model. An initial system design and implementation that combines a tracking and following approach with a mapping system is described. Observations from a pilot study in which this initial system was used successfully are reported and support the assumptions about the usefulness of the environment model that is used as the link between robotic and human representation.
Robots are rapidly evolving from factory work-horses to robot-companions. The future of robots, as our companions, is highly dependent on their abilities to understand, interpret and represent the environment in an efficient and consistent fashion, in a human compatible manner. The work presented here is oriented in this direction. It proposes a hierarchical probabilistic con- cept oriented representation of space that is based on typical household objects and structural elements such as doors (and walls). The primary contributions of this work are in the areas of representation and conceptualization for mobile robots; the key focus being, an increase in the semantic content of state-of-the-art mobile robot spatial representations and an increase in the spatial awareness (understanding) of a mobile robot about its surroundings. Thus, the theme of the work is - representation and conceptualization for spatial cognition in mobile robots. The work itself is holistic in that it also studies other related facets of the problem - these include perception and user studies (a cognitive validation of the thesis).
Autonomous world modeling is one of the major topics in current robot research. A basic concept hereby is the registration of consecutive range images. Consistent models can only be built with robust registration methods. Already one incorrect registered range image will affect the following registrations and lead to an inconsistent model. Therefore we developed a robust ICP registration method based on an octree matching strategy. This matching strategy could cope with large odometry errors and achieved the generation of consistent 3D models.
The usage of geometrical 3D-models enables humans to plan and design different aspects of indoor environments. This article describes the autonomous collection of 3Ddata using a laser scanner and monocular vision. Furthermore the constructed 3D-model can also be used by other autonomous robots for navigation and localization tasks. The process of building a 3D-model without any geometical preknowledge leads due to odoemetric and sensor uncertainties to the known simultaneous localization and mapping (SLAM) problem. This article copes with the SLAM problem by using 6DoF-registration and loop closing combined with an intelligent action planning scheme for indoor environments. Color information from a monocular vision system is used for registration and texture mapping.
This paper is an invitation to use mono-vision techniques on stereo-vision equipped robots. The advantages of both mono-vision (bearing-only, with infinity range but no 3D instant information) and stereo-vision (3D information only up to a limited range) naturally add up to provide interesting possibilities, that are here developed and demonstrated using an EKF-based monocular SLAM algorithm. Mainly we obtain: a) fast 3D mapping with long term, absolute angular references; b) great landmark updating flexibility; and c) the possibility of stereo rig extrinsic self-calibration, providing a much more robust and accurate sensor. Experimental results show the pertinence of the proposed ideas, which should be easily exportable (and we encourage to do so) to other, more performing, visionbased SLAM algorithms.
Proc. IEEE International Conference on Robotics and Automation (ICRA), Roma, Italy, April 2007.
Some recent stereo matching algorithms are based on graph cuts. They transform the matching problem to a minimisation of a global energy function. The minimisation can be done by finding out an optimal cut in a special graph. Different methods were proposed to construct the graph. But all of them, consider for each pixel, all possible disparities between minimum and maximum values. In this article, a new method is proposed: only some potential values in the disparity range are selected for each pixel. These values can be found using a local analysis of stereo matching. This method allows us to make wider the disparity range, and at the same time to limit the volume of the graph, and therefore to reduce the computation time.
Proc. IEEE International Conference on Image Processing (ICIP 2007), San Antonio (USA), September 2007.
POP - Perception On Purpose POP was a 3-year scientific project (January 2006 - December 2008) granted by the European Commission under Cognitive Systems. EC project number is FP6-IST-2004-027268. The main scientific results of POP will be presented at CogSys 2010, the 4th International Conference on Cognitive Systems, January 27 & 28 2010, ETH Zurich, Switzerland. POP on ICT-results : Robotic perception, on purpose The final report (May 2009) is available here : POP Final report. Project coordinator and contact : Radu Horaud, INRIA Grenoble Rhône-Alpes, France Partners : INRIA, U. Osnabrück, U. Hospital Hamburg-Eppendorf, U. Coimbra, & U. Sheffield (see below for more details or click on the "Partners" button). POP’s experiments : EEG & eye-tracking, POPEYE - the audiovisual robot head, Multiple speaker localization Summary The ease with which we make sense of our environment belies the complex processing required to convert sensory signals into meaningful cognitive descriptions. Computational approaches have so far made little impact on this fundamental problem. Visual and auditory processes have typically been studied independently, yet it is clear that the two senses provide complementary information which can help a system to respond robustly in challenging conditions. In addition, most algorithmic approaches adopt the perspective of a static observer or listener, ignoring all the benefits of interaction with the environment. This project proposes the development of a fundamentally new approach, perception on purpose, which is based on 5 principles : visual and auditory information should be integrated in both space and time. active exploration of the environment is required to improve the audiovisual signal-to-noise ratio. the enormous potential sensory requirements of the entire input array should be rendered manageable by multimodal models of attentional processes. bottom-up perception should be stabilized by top-down cognitive function and lead to purposeful action. all parts of the system should be underpinned by rigorous mathematical theory, from physical models of low-level binocular and binaural sensory processing to trainable probabilistic models of audiovisual scenes. These ideas will be put into practice through behavioural and neuroimaging studies as well as in the construction of testable computational models. A demonstrator platform consisting of a mobile audiovisual head will be developed and its behaviour evaluated in a range of application scenarios. Project participants represent leading institutions with the expertise in computational, behavioural and cognitive neuroscientific aspects of vision and hearing needed both to carry out the POP manifesto and to contribute to the training of a new community of scientists. Partners INRIA (Computer vision and statistics groups) - Project coordinator and partner 1 ; University of Osnabrück (Institute of Cognitive Science)- Partner 2 University Hospital Hamburg-Eppendorf (Institute of Neurophysiology and Pathophysiology) - Partner 3 University of Coimbra (Institute for Systems and Robotics) - Partner 4 University of Sheffield (Speech and Hearing group) - Partner 5
PowerWise Microphone Amp with Far-Field Noise Suppression
Transform audio design with the LMV1088 dual-input microphone array amplifier that draws only 1 mA of supply current with a power supply rejection ratio (PSRR) of 85 dB at 1 KHz, a typical signal-to-noise ratio (SNR) of 60 dB, and less than 1 percent total harmonic distortion plus noise (A-weighted THD+N).
This small autonomous robot is able to turn its camera towards a sound source by comparing time and intensity differences at its two ears. The robot was developed within the European POP (perception on purpose) project.
Acoustics Computing for AI/AmI Applications [ACAIA.org]
FP7 ICT6 2.1 Cognitive Systems STRET proposal 13Apr10 (-58d)
Invitation to Research effort
2009.2.1 a) New approaches towards understanding and solving
key issues related to the engineering of artificial cognitive systems
AUDIOBOT (SOUNDBOT)Artificial Audio Cognitive System serving Audition sense and Auditory capacity to Robot
Dear attendant of Jan2010 LUX InfoDay;
Briefly; I have had some exchange with our working team of Avcoustics Computing, we made progress but there are lacks in the robotics and cognitive systems workpackages and partners of the consortium, and I am concerned that time is running out (-58d).
I attach a brief overview of the project concept. This outlines the background, the general concept and some ideas regarding work packages. Please note that this is preliminary, and the WPs need to be developed further, and sub-tasks identified. Please also remember that we are answering a robotics call, where the problem is how to achieve artificial cognitive systems for robotics.
I have some short but important questions:
1. Are you prepared to participate? 2. What can you contribute by way of expertise? I know the concept is very brief, and may not yet be self-explanatory. 3. Are you prepared to act as WP leader? This means taking responsibility for developing the WP concept and planning the sub-tasks. 4. Are you available for a skype conference later this week? I suggest Friday, at 12:00 CET, If this is not possible, please suggest another date and time.
I look forward to hearing from you shortly, and to an intensive period of proposal writing! However, we believe the concept is both novel, relevant and feasible. I suggest we aim for a STREP big-size project, with a budget in the region of (max) 4.5 M€ including our own matching funding and the EC contribution.
Feel free to call me if you want to discuss this, my number is given below.
Please, don't hesitate to contact me, for any question raised form this email,
Best regards
Ferran
----------
The name AUDIOBOT is a concatenation of two radical taken from the words AUDIO and roBOT.
AUDIOBOT is envisaged as a robot with ears, natural hearing and listening behaviours, auditory capacity and audition sense to detect acoustics events, recognise activities and multi-sensory reasoning for interaction to provide high reliable information services for human collaboration and object cooperation.
AUDIOBOT acts as human listening capable of performing a variety of complex human tasks on command via voice/graphical user interfaces or programs via the platform.
The first research LISTENOR focus is on audition sense and auditory capacities, reception, natural behaviours and streaming of natural environmental sounds and noise.
The second research AUDIOBOT focus is on the capacity of processing of acoustics in order to identify events and recognise activities. This include two main areas: the audio-ontology development and the (provision of meaning) to the audio streaming in a protocol of audio graphical representation, categorisation and processing.
The third research SOUNDBOT focus is on sounds understanding to interact and collaborate for planned goals. This include the Bayesian processing and risk management and take decision tools.
Finally the four main research focus of AUDIOBOT is on the self-learning and external-training of the audition capacities of robots and robotics machineries. This include the Internet of Sound Architectures and Services of acoustics training.
AUDIOBOT benchmark the safe human-robot interaction and collaboration improvement by a better understanding of the external environment, for the major information and high quality of multi-sensory and contextual reasoning.
AUDIOBOT refers to an open standard audio-robot to provide reliable information supportive services for interaction and human collaboration and cooperation in Ambient Intelligent applications such as homes and/or remote monitoring services.
---------------
The consortium raised on before proposals and in the conference “State of the Art, Opportunities and Challenges of Acoustic Computing for Artificial and Ambient Intelligence Applications to provide world-wide innovative Information Society services” on Set09 in BCN
The community is multidisciplinary and involves the following technologies:
Artificial Audio Cognition systems; Audition sense and Auditory capacities for robotics; Audio-Sensing and Audio-Sensors; Audio (sounds-noises) computing (acquiring, processing, understanding, learning and reasoning); Audio-sensors networks; Acoustic Events Detection (AED); Audio representation and semantic classification; Internet of Sounds and services; Audio-sensors networks; Activity Recognition and goal collaboration; Audio-Services and Solutions and Social activities and other not mentioned;
Initial core team consortium:
Num.
Partner name
code
state
1
Muficata s.l.
MUF
ESP
2
Fraunhofer-Gesellschaft Forderung der angewandten Forschung (IDMT)
FHG
GER
and the following application for provide information (monitoring, reporting, informing, warming, alerting or alarming) assistive and supportive services for human:
Tele-monitoring and surveillance in Hearing remote temporal, moving and heterogeneous environments; Public and Personal Security and Privacy; Inclusion and Health; Mobility and Transit; Energy-efficiency; Logistics and Manufacturing and other domains and industrial sectors still not listed.
AUDIOBOT design, develop, evaluate and benchmark Artificial Audio Cognitive System able to learn and understand the meaning of sounds, reasoning activities and optimising reactions, in base of identify events and recognise contextual situations though acoustic processing in hearing and listening environments.
------------------
Main Work-packages
WP0 Project management and coordination (MUF)
WP1 Definition, concretion and detailed of works and objectives.
1) Involvement of end-users and experts for collaboration in the design, development and evaluation.
WP2 Audio-sensors and Audio acquisition, streaming and processing (IDMT)
1) Design and development of a interactive Mechatronic crown LISTENOR of two or three dual pair-microphones acting in two behaviours relaxed (omnidirectional) and inter-active (directional, tracking or focused on one point)
2) Design the audio-system to allow two main behaviours:
passive/relaxed for static hearing (omnidirectional sounds)
active/focused listening (addressing a specific or interesting audio points) such as a conversation or the main direction of a track or path when moving
3) Build a functional sounds and noises acquiring prototype, similar to plastic crown of two pair of double microphone situated in the opposite poles for fitting in the head, of an assistant robot.
The audio system, include the software to optimise the streaming, compilation, integration and multi-pre-processing of the audio-information acquired by the diverse microphones.
WP3 Audio meaning and understanding (representation and semantics) (IDMT)
Audio data analysis for situational awareness and detection of normal and abnormal activity. Data input from multiple microphones. Sounds and Event tracking.
Audio-Data must/can also be analysed for identifying and non-identifying activities and either saved as separate datasets or not saved. Audio-streaming data from previous time be “saved” depending on level of readiness, from minutes to days.
1) Design and Develop a graphical system to describe Acoustic Events as graph of sounds that has the information to be identify when the acoustic filtering process an audio stream;
2) Design, develop and implement a Semantic Library and Gallery of “Audio-Pictures” pieces of software that describe and filter a specific acoustic event.
3) Design and Evaluation of trust and reliability
WP4 Artificial Audio cognition system (AACS) for robotics
WP5 Multi-sensory interactions for collaboration with human
1) Developing syntax language for interacting with autonomy to build and develop a Bayesian framework for probabilistic control be able of take decision and extract solutions few prior structural knowledge in uncertain situations and complex dynamical environments and situation Assessment and conveying information within uncertainty
Enhancing User understanding of autonomous systems though the application of mental models
Designing architectures for complex autonomous systems (and integration of sub- systems)
Approaches to provide information to assist in design in Agent- based models
Knowledge Elicitation techniques (Semantic mapping) as tools to provide insight into building Agent behaviours and values
2) Development of autonomous robot teams to show collaborative behaviours with minimal user input, scaled for priority to allow dynamic goal- directed inject from user
WP6 Training and Learning Audition Sense (reasoning and interaction frameworks)
Learning Artificial Audio Cognition system to improve the quality of the audio perception of robots in order to recognise better the contextual situation and the understanding of tasks. We need a Internet of things and Services to training robots. The process initially designed could be collect all the abnormal sounds processed and feed in a internet semantic database to feedback the robots to upgrade software and improve the audition capacities (quality and quantity of activities recognized).
- Creating an artificial system, based on Internet services, capable of learning actions based on human audition sensory abilities to collect, process and classified AE to retrofitting the robot in upgrade packages
- Building a semi-driven system for deciding which actions are ore suitable in every situation of a collaborative tasks to training the robot. This will use collective behaviour arising from the interplay of individuals that recommends the “best action” for the current state of the tasks and combines hand-on trained systems and models models to guide the learning actions process
WP7 Benchmarking and evaluation for human collaboration in real-cases of task
1) Design Human Interaction for evaluation of human collaborative tasks
2) Drawn the situation and comparative assessment in different technological areas of the AUDIOBOT and robotics:
- Improvement of Positioning, Situation Aware and Trajectory Tracking capacities for autonomous Navigation in complex real environments
- Precision of the detection of events and recognition of activities and definition of the context
- Definition of contextual planned goals
3) Comparative evaluation of Reliability & Quality of service (mono – audio and video -, dual – audio plus video, and three senses – audio, video and gyroscope (speed perception) in specific collaborative tasks
4) Final and global assessment of human performance within Cognitive systems
WP8 AACS info-space and community for robotics and ICT solutions in application sectors
An Internet of Sounds and Services will be implemented to provide audio learning and training systems.
WP9 Dissemination and sustainable exploitation of AACS for industrial/sectoral applications
AUDIOBOT is opening and introducing an wide-world innovation and European “added-value” in the domain of sounds for robotics in particular in their capacities for listening, in the enhancement of robotics of recognition of events and understanding contextual situations. Additionally, this value refers to the speciality and multidisciplinary of the works and partners that cannot be found of the European level.
AUDIOBOT implies at least:
1) Leading-edge research capacity in Europe in audio cognitive systems engineering and robotics.
2) World-wide Innovation in service robots, and industrial production and manufacturing processes, in particular in the sector of telesurveillance, security and protection, Inclusion and Healthcare.
3) A new widespread comparative assessment of audition capacities for robot performance in different applications, collaborative tasks and technologies.
4) A new ICT of acoustics processing for AI/AmI market opportunities, and technologies and applications and services for increased productivity and efficiency in EU industries and administrations.
--
en Ferran Cabrer i Vilagut
Identity, diversity and complexity for sustainability
[CA] Cabrer.cat Ferran @ Cabrer.cat
[EN] Cabrer.org Ferran.Cabrer @ MUFICATA.net
_______________________________________________________________________
MUFICATA s.l. - Natural Information Environments
Scientific Prospectors, Technological Envisioners, Germinal Ideas,
Conceptual Developers, Consortia Builders, Collaborative Netwokers,
Innovative Performers, Responsible Managers and Sustainable Maintainers.
Skype/Google: Ferran.Cabrer T:+34 93 423 8267 M:+34 6222 54458
Jaume Fabra 12 08004 Barcelona (Catalonia)
Acoustic Computing for Artificial and Ambient Intelligent Applications