Google Summer of Code 2020
03 Feb 2020
General information on applications
This is the list of ideas for the Google Summer of Code 2020. If you are interested in any of the ideas listed below we encourage you to apply if RoboComp gets accepted into GSoC’20.
- It is important to first familiarize with the software (https://github.com/robocomp/robocomp).
- Please, go through the available tutorials and direct your questions to the gitter chat (links available in contact section).
- Please read all the information posted in this page before applying.
- Make sure you are familiar with the required skills for the idea.
- Since several of the mentioned RoboComp tools and components are not explained here to keep this list short, we encourage everyone to check the RoboComp documentation linked below.
- Mentors and backup mentors are listed right after the idea explanation. All mentors contact info is at the end of the page. Contact them directly for specific questions on the idea.
Robocomp installation and tutorials: https://github.com/robocomp/robocomp/tree/stable/doc#tutorials
If you have not worked before with Git Branching, we encourage you to visit this web: https://learngitbranching.js.org/
Where can I start and what to include on my application?
You are encouraged to go through these steps for a better understanding and follow-up of your application:
- Download and install RoboComp: https://github.com/robocomp/robocomp/blob/stable/README.md.
- Follow the tutorials: https://github.com/robocomp/robocomp/tree/stable/doc#tutorials.
- Once you are familiar with RoboComp and the components and tools involved in the particular idea you want to contribute to, try to understand how these components/tools work and, if possible, their design.
- Participate in gsoc gitter asking any question you might have.
- Create a schedule with the milestones you plan to follow during the GSoC 2020 program.
- Send the schedule and any code you might have written along with your CV and other application materials to the mentor of your idea and the backup mentor(/web/gsoc/2020/ideas/#complete-list-of-mentors).
For general questions about RoboComp please use: The gitter chat.
LIST of IDEAS for GSoC 2020
1. Human-robot dialogue and collaboration for social navigation
Mentors
Juan Carlos García, Pedro Núñez
Brief description
The navigation of a robot in an environment with humans is a subject with enormous interest in the last years. To be accepted in these types of scenarios, it is important that the robot navigate respecting social norms, for example, avoiding getting too close to humans, avoiding interrupting a conversation or asking permission to pass through a blocked path. This slot aims to describe the dialogue manager, besides the corpus that allows establishing dialogues between the robot and the humans in real situations to improve the behaviour of the robot navigation system, making it more socially accepted. The dialogue manager should be a RoboComp agent, which reads information from the robot’s world representation (a graph) and adapts the conversation to the current situation (e.g, according to the age of people, genre, etc). The idea of this slot is to select open-source technologies such as PyText or Rasa NLU (https://rasa.com/).
Needed skills
Python, Qt5, Natural Language Processing.
2. Development of graphical user interfaces for robot controlling and monitoring
Mentors
Pedro Núñez, Araceli Vega
Brief description
The complexity of robots is increasing year by year. The functionalities to navigate, to detect objects, to recognize people, to interact with humans, etc are challenges that nowadays the scientific community tries to solve. However, it is clear that all these modules do not work properly in most robots, or their functionality is incomplete. In this project, we propose the design of user interfaces that allows different functionalities to teleoperate the robot and generate inputs/outputs directly by the user (without the need of having an implemented module).
Among these functionalities, the user interface must allow sending the robot to a specific position, rotate, talk to humans (TTS), enter the response manually (like an ASR). Many of these actions will be simulated directly by the user, adding all the necessary information in the cognitive architecture of the robot.
Needed skills
C++, Python, Qt5.
3. Randomly generation and storage of the graph-based world representation used in RoboComp
Mentors
Luis V. Calderita, Alberto Serrano
Brief description
Currently, RoboComp keeps in real-time the state of the robot and the world around it. For this, RoboComp uses a graph-based world representation that evolves over time. This graph represents the robot’s perception of its environment and is maintained by the sensors onboard the robot.
The aim of this slot is twofold, on the one hand, it is intended to include a tool that can store the evolution of the robot’s world representation. To do this, the idea is to represent the graph in JSON format and store it in a document-oriented database such as MongoDB. This first part aims to introduce the student to the concept of world representation used in RoboComp and to RoboComp itself. Once graphs are stored, the second objective is to identify the most similar graph to a given one. To find this similarity among graphs, we pretend to use Machine Learning techniques, specifically Graph Neural Networks. Consequently, we need a tool to generate huge data sets automatically based on the stochastic sampling of a formal spatial grammar encoding all valid worlds. State of the art GNN learning algorithms will be used to obtain a metric over the space of graphs. A final feature of the slot is to facilitate the human supervision of the learning process. The representations generated and retrieved will be rendered using a physics-based robotic simulator. Our interest currently lies in the V-REP simulator, already supported by RoboComp, to render the stored graphs and help in the search process.
Needed skills
Python3, Qt5, MongoDB, JSON, XML, V-REP.
4. Efficient acceptable social behaviour using machine learning techniques
Mentors
Ronit Jorvekar, Pilar Bachiller
Brief description
The detection and implementation of social skills are usually handcrafted by developers writing hardcoded if-then-else constructs in the robots’ code. This code usually consists of a series of queries to the robots’ world representations (e.g., if the robot is closer than 400mm to a human the person might consider the behaviour inappropriate, if two humans are interacting the robot should not interrupt their visual contact). Although this is a valid approach, it is very time consuming and scales poorly. This slot aims to improve the efficiency of the component in charge of this, generating a heat map instead of a single value per query. The input information used to compute such heat maps will be the robot’s world representation (a graph) and the output a two-dimensional array. The machine learning techniques used will be Graph Neural Networks and Convolutional Neural Networks. We will be building on top of a similar slot that took place last year. The current implementation is available at https://github.com/robocomp/sngnn.
Needed skills
Python, DGL/Pythorch-geometric.
5. RoboCompDSL: new functionalities for more resilient components
Mentors
Pablo Bustos, Esteban Martinena
Brief description
Currently, RoboComp uses a code generation tool, RoboCompDSL, to generate C++ and Python components. This tool uses two different domain-specific languages, one to define components (CDSL) and another one to define interfaces (IDSL). We have seen that the mode code is generated, the fewer errors a programmer makes when creating components. In this task, we want to extend these languages to include features that will make more resilient components. An example of what we have in mind for IDSL files (in red new language features),
struct Data { int cameraId[range=0:10]; // asserts to control range of parameters’ values. int width[range=320,640,1280]; int height; int depth; };
interfaceCamera[pub-sub] { void newPeopleData (Data people[buffer=on] ); / /thread safe interchange buffer in final code };
Interface Robot[action] // all declared actions must be asynchronous and provide a getStatus method { void gotoPoint(int x, int y, float angle[range=-pi:pi]); Status getStatus(); };
interface Camera[query] { Image = getImage(); };
Needed skills
Python,C++11, RoboComp, Formal languages.
6. Beautify LearnBlock
Mentors
Iván Barbecho, Luis V. Calderita
Brief description
LearnBlock is an educational programming tool developed for learning programming through robotics. It has been designed to facilitate the learning process starting with a visual language and progressing towards a professional programming language. LearnBlock includes many interesting features: robot-agnostic design; blocks can be created from code without modifying the core code of the tool; robots can be programmed using different languages (visual, Block-Text or Python); a program can be run and interrupted from the tool itself, ensuring proper stop and disconnection of the robot.
To make LearnBlock more appealing, this project aims to beautify the tool by creating new shapes for the blocks composing the visual language. Thus, blocks will be redesigned and integrated into LearnBlock. In addition, new types of connections will be defined to reduce potential syntactical errors. Besides these changes, a new and more intuitive method for changing the input parameters of the blocks (for instance, rotation and translation speed in a block in charge of moving the base of the robot) will be defined and implemented. All these modifications will be included with little impact in the core code of the tool.
Repository
https://github.com/robocomp/LearnBlock
Needed skills
Python3, Qt.
7. Syntax-error highlighter for LearnBlock
Mentors
Pilar Bachiller, Iván Barbecho
Brief description
LearnBlock is an educational programming tool developed for learning programming through robotics. It has been designed to facilitate the learning process starting with a visual language and progressing towards a professional programming language. Specifically, the user can choose whether to create a program from blocks, from the textual representation of blocks (Block-Text) or coding in Python. The final code executed by the robot is Python code. This final code is generated from the Block-Text code created by the user or obtained from the visual code. If a syntax error is found in the Block-Text code, the Python program is not generated and LearnBlock informs the user that an error was found. However, no message about the specific error is shown.
This project aims to endow LearnBlock with the ability to determine the different syntax-errors of a program and to display those errors in both, the visual program and the Block-Text code. Those parts of the code containing a syntax-error will be highlighted. In addition, information about the kind of error and some guidelines on how to correct will be displayed as long as the user asks for it (for instance, clicking on a highlighted statement or block).
Repository
https://github.com/robocomp/LearnBlock
Needed skills
Python3, Qt, PyParsing
8. Hand gesture recognition
Mentors
Aditya Aggarwal, Francisco Andrés
Brief description
Hand gesture recognition is very significant for human-robot interaction. It can be used as a tool of communication between humans and robots. It can be further extended to a component that can understand sign language and provide a user-friendly way of communication with the robots.
This component can have a 2 stage pipeline. The first stage will involve detecting hand and estimating hand shape & pose. These can be used as features for the second stage which can be a classifier based on similarity score. Large public image datasets are also available which can be used to train these models.
Needed skills
Python, RoboComp, Tensorflow or PyTorch.
Required Domain Knowledge: ComputerVision.
9. Python3 bindings for the InnerModel library
Mentors
Luis J. Manso, Ramón Cintas
Brief description
InnerModel is RoboComp’s geometry and world model representation library. It is used to compute geometric transformations, projections, transformation matrices and even to visualise the robots’ internal world models. Although it is heavily used in most RoboComp components, the Python bindings currently available are very limited and have not been properly tested in Python 3. This slot aims to complete the Python bindings and develop the corresponding unit tests to ensure the API is working as expected. The current wrapper implementation uses Boost-Python, but it would be possible to try other alternatives.
Needed skills
Python, C++.
10. Human recognition (identification) using multi-modal perception system
Mentors
Diego Faria, Aditya Aggarwal
Brief description
Detecting people in an environment is a task of enormous interest in robotics. In addition, the ability to identify the person among a finite number of users is a very interesting challenge for later complex tasks such as personalized interaction, social navigation, etc. This project aims to implement a software agent with RoboComp that allows, from a multi-modal RGBD sensor, the identification of people in the robot environment. The identification should not only use the face of the detected people but also using other information related to the skeleton or silhouette of the body, even with semantic information previously extracted (or manually introduced from a GUI).
Needed skills
Python, RoboComp.
Required Domain Knowledge: Computer Vision.
11. Kinova Gen3 test components using VREP
Mentors
Pablo Bustos, Francisco Andrés
Brief description
Kinova Gen3 is a state of that art robot manipulator with 7 DoFs and a two-finger gripper. It also includes an RGBD camera in the wrist. All motors have torque sensors. Kinova provides a high-quality programming SDK for Python and C++. We also have Gen3’s model for the VREP simulator. In this task, we want to create RoboComp components that embed Kinova’s SDK and offer our well-known interfaces. Also, we want to create a couple of scenarios to test the robot in actions. One test scenario can involve classical planning algorithms for the blocks world, and another one can use more recent learning techniques such as Deep Reinforcement Learning. The physics enabled simulator VREP will be of great help for remote working and running learning algorithms.
Needed skills
Python, C++11, RoboComp, V-REP.
12. RoboComp/ROS integration
Mentors
Francisco M. Rico, Luis J. Manso
Brief description
This slot aims to endow robocompdsl –RoboComp’s component generation tool– with the ability to generate ROS2 components based on CDSL descriptions and to ensure that the currently available integration with ROS1 is working properly. To this end, the student working in this project will update the C++ and Python3 templates and generate a set of unit tests to ensure that the integration is correct and to facilitate the early detection of bugs.
Needed skills
Python3, C++17, RoboComp, ROS, ROS2
13. Automatic and systematic generation of RoboComp components for testing
Mentors
Esteban Martinena, Alberto Serrano
Brief description
RoboComp is a Robotics framework based on software components. Building new components is done using the domain-specific language CDSL. RoboComp currently has the rocobocompdsl tool to generate and modify components based on this language and files.
One of the biggest problems to make changes to the robocompdsl tool or any of the specific domain languages used in RoboComp is that any minimal change implies the need to test many different variations and possibilities of the component generation tool to be sure that no bug has been introduced. This slot is aimed at creating a new tool that uses the CDSL grammar to automatically and systematically create new valid .cdsl files, create empty components from those CDSL files, compile, and report. This way, a full test of the component generation could be executed anytime that a new change is introduced to robocompdsl. This tool will be a great addition to the current tools of the RoboComp framework.
Needed skills
Python, RoboComp Formal Languages.
14. Live topic translation for ROS (and others) with System Of Systems Synthesizer
Mentors
Marco A. Gutierrez, Ramon Cintas
Brief description
The System of Systems Synthesizer (SOSS) is a software meant to connect the messages sent between different communication middlewares. Up to now, it supports messages from systems like ROS1, ROS2, WebSockets, fiware and DDS. The communication is configured in a YAML file and when SOSS is launched, messages on one system can be directly transferred from one to the other.
RoboComp communications are handled by the Ice communications middleware by ZeroC, by enabling SOSS with Ice support we will be able to connect RoboComp components to a new wide range of systems and applications.
Needed skills
Python, RoboComp, C++, ICE, Yaml, ROS.
15. Software agent for estimating occupancy in medium and large buildings using RGB cameras
Mentors
Sergio Barroso, Araceli Vega
Brief description
Measuring the number of people (occupation) in an environment is a problem of great interest in different areas of research. Today there are different techniques to estimate the number of people in a small-medium sized room/class by using different sensors. Among these sensors we can use infrared, RGBD cameras, thermal cameras, CO2 measurement sensors…
The idea of this project is to provide RoboComp with a software agent that, from RGB images, estimates the occupancy in crowded environments. The possibilities are several, but we consider here the use of deep learning techniques for the detection of skeletons, faces, etc, to infer the number of people robustly.
Moreover, the agent must also consider its extension to the use of different cameras that register the access to a large building from different points of view, and the logic of occupation control: who enters, who leaves…
Needed skills
Python, RoboComp.
16. Creation of DNN controlled components for RoboComp
Mentors
Francisco Andrés, Carlos Muñoz
Brief description
RoboComp is a Robotics framework based on software components. Building new components is done using a code generator based on the domain-specific language CDSL. Currently, the user can choose between two models of execution for new RoboComp components: sequential and state-machine controlled. This slot aims to create a new execution model for DNNs. The CDSL language will be extended to include the necessary new keywords and options. These new options will select which run-time tools (TensorFlow, PyTorch, Caffe) and model formats will be included in the new component and the necessary dependencies. RoboComp’s parser and code generator will be modified to generate Python supporting the loading, initialization and execution of DNN models. As an optional development, the CDSL language could be further extended to define multi-network structures such as sequential concatenation of DNN models to build more complex functions.
Needed skills
Python, RoboComp, TensorFlow, PyTorch, Caffe, formal grammars.
17. Behavior-Trees components for RoboComp
Mentors
Francisco Andrés, Carlos Muñoz
Brief description
RoboComp’s components are generated automatically from a concise description written in a DSL named CDSL. Currently, components can be generated by selecting two control flow options: sequential and based on a state-machine. Behavior-trees are a relatively new tool to define complex control flows for robots that are both modular and reactive. They are an alternative to State-Machines of widespread use in the video-games industry. This slot aims at extending RoboCompDSL (the code generator) to include a new control flow based on Behavior-Trees that can be specified using a simple, new DSL and then transformed into C++ or Python code as part of the newly generated component. This new DSL will be expressive enough to represent the basic BT’s elements, i.e. composite, decorator, leaf, sequence, selector, etc. and an efficient code will be generated using some existing C++11 or Python libraries. Finally, a set of examples and demonstrators will be recorded using the V-REP robotics simulator.
Needed skills
C++11, Python, RoboComp, Formal Grammars.
18. Graphical, automatic, real-time animation of RoboComp state-machines
Mentors
Ramón Cintas, Esteban Martinena
Brief description
A very important tool of RoboComp is the state-machine code generator. The design and implementation of complex robot behaviour demand specific tools to create and debug highly structured programs that can be debugged during real-time execution. This project proposes the extension of RoboCom’s state-machine code generator to include new derived classes that will bring real-time animation of the program flow. The student will need skills in C++, Qt and Python to create code that will display the state-machine as an animated structure in a transparent way for the user of the tool. This graphic feature will be optional and non-intrusive for the logic counterpart.
Needed skills
C++, Python, Qt5.
19. Distributed Working Memory for Robotics Cognitive Architectures
Mentors
Francisco Rico, Pablo Bustos
Brief description
Robotics Cognitive Architectures (RCA) define specific layouts to organize and interconnect a functional description of a knowledge-based system. Working memories are the places where information from sensors, actuators and goals is gathered to represent the current state of affairs for the robot. It holds a detailed representation of the robot itself and the relevant objects and actors in the surrounding scene. Most of this information is provided by Deep Neural Networks after transforming the raw input data. On top of RoboComp, we have created CORTEX, an RCA that uses a spacial graph as its working memory and a set of processes (agents) that contribute information to this graph and reads information from to drive the robot’s actuators. The current version of this graph is managed by one of these agents working as an in-memory real-time database. In this slot we want to design a new version of this graph that is completely distributed, using a theory named Conflict-Free Replicated Data Types (CRDT). A data structure created this way only exists as the local copies that each agent maintains, and editing can be done very efficiently using multi-cast communications. In a certain way, it is similar to a GDrive document. In this slot, we want to create the first prototype of a middleware-agnostic graph using CRDT theory and validate it through a set of CPU and communication-intensive tests.
Needed skills
C++, Concurrent and Distributed Computing.
20. DNN’s for precise manipulation of household objects
Mentors
Pablo Bustos, Pilar Bachiller
Brief description
The intelligent control of humanoid robots (HR) was the reason RoboComp was created. One of the most challenging tasks of HR is to manipulate objects. Think of grasping an object as a change taking place in the robot geometry, since it is now extended with the object held. Part of the room is now part of the robot. In this slot, we want to explore the possibilities of our new Kinova Gen3 arm to grasp different household small objects. To achieve a successful grasping the robot has to detect, recognize and model (3D complete pose) the object it is going to interact with. We want to apply the new DNN techniques to transform RGBD images into well-defined poses, so the robot can plan a safe grasp on the object. We will use V-REP to simulate the robot and the scenarios and validate the results with a set of grasping actions on different objects.
Needed skills
C++, Python, DNN.
21. An Automatic Speech Recognition (ASR) component for RoboComp
Mentors
Sparsh Garg, Pablo Bustos
Brief description
Nowadays, Speech is playing a significant part in Human-Robot Interaction e.g. speaker detection system or voice command detection. Different ASR modules have been developed so far for their use in different speech-input based applications.
In this project, we want to build a component capable of recognizing the speech of different users that can work offline. One can consider using DeepSpeech architecture. DeepSpeech is an open-source Tensorflow-based speech-to-text processor with reasonably high accuracy. It uses the latest and state-of-the-art machine learning algorithms. The student will also have to evaluate the quality of the captured signal and the transcriptions obtained, for the development of future dialogue systems between humans and robots.
Needed skills
Python, Any Computer vision and NLU library e.g. Pytorch, TensorFlow, Qt, C++.
22. Integration of an emotion recognition component in RoboComp
Mentors
Daniel Rodriguez, Diego R. Faria
Brief description
Advances in social robotics and the development of new technologies are allowing the introduction of assistant and service robots in our daily lives. To create such applications, a robot should be able to coexist and co-operate with the human presence in a friendly way. For this to happen, a number of issues need to be solved, including modelling and recognition of emotions in order to establish an affective loop between users and robots, aiming at a proper communication through Human-Robot Interaction (HRI). Emotion perception is a topic pursued by different fields such as psychology, computer science and engineering, including advanced robotics. A robot can be endowed with the ability to analyse verbal and non-verbal behavioural cues displayed by the user to infer the underlying emotions. In this slot, we will design an emotion recognition system for human-robot interaction towards establishing an affective loop between humans and robots. The challenge will be focused on machine learning techniques to autonomously select robust models for real-time emotion recognition. The implemented algorithm will be integrated into RoboComp as part of an HRI framework that can be linked to action/activity recognition, or any other robot communication abilities within different scenarios.
Needed skills
Python, DLib, user-level machine learning libraries
Complete list of Mentors:
Pablo Bustos
pbustosATunexDOTes
Associate Professor, RoboLab,
University of Extremadura
Pilar Bachiller
pilarbATunexDOTes
Associate Professor, RoboLab,
University of Extremadura
Luis J. Manso
l.mansoATastonDOTacDOTuk
Lecturer in Computer Science,
School of Engineering & Applied Science, Aston University, UK
http://ljmanso.com
Pedro Núñez
pnuntruATunexDOTes
Associate Professor, RoboLab,
University of Extremadura
Aditya Aggarwal
aditya.aggarwalATstudentsDOTiiitDOTacDOTin
RoboComp Developer
International Institute of Information Technology, Hyderabad
Francisco Andrés
pacoanATunexDOTes
Lecturer, Robolab
University of Extremadura
Ivan Barbecho
ibarbechATalumnosDOTunexDOTes
RoboComp Developer
Sergio Barroso
sbarmirezATgmailDOTcom
Researcher, Robolab,
University of Extremadura
Luis V. Calderita
lvcalderitaATunexDOTes
Researcher, RoboLab
University of Extremadura
Ramon Cintas
rcintasATunexDOTes
Researcher, Robolab,
University of Extremadura
Diego R. Faria
d.fariaATastonDOTacDOTuk
Lecturer in Computer Science,
School of Engineering & Applied Science, Aston University, UK
Juan Carlos García
juancarlos97ggATgmailDOTcom
Researcher, Robolab,
University of Extremadura
Sparsh Garg
ksparsh30ATgmailDOTcom
Robocomp Developer
International Institute of Information Technology, Hyderabad
Marco A. Gutiérrez
marcogATunexDOTes
Robocomp Developer
Ronit Jorvekar
ronitjorvekar007ATgmailDOTcom
RoboComp Developer
Pune Institute of Technology, Pune India
Esteban Martinena
emartinenaATunexDOTes
Researcher, Robolab
University of Extremadura
Carlos Muñoz
cmzar97ATgmailDOTcom
Researcher, Robolab,
University of Extremadura
Francisco M. Rico
franciscoDOTricoATurjcDOTes
Associate Professor,
University Rey Juan Carlos
Daniel Rodríguez
190229717ATastonDOTacDOTuk
PhD Student,
School of Engineering & Applied Science, Aston University, UK
Alberto Serrano
aserranokwATalumnosDOTunexDOTes
Researcher, Robolab,
University of Extremadura
Araceli Vega
avegamagATalumnosDOTunexDOTes
Researcher, Robolab
University of Extremadura