Search Results for author: Oriana Riva

Found 9 papers, 4 papers with code

AndroidWorld: A Dynamic Benchmarking Environment for Autonomous Agents

1 code implementation23 May 2024 Christopher Rawles, Sarah Clinckemaillie, Yifan Chang, Jonathan Waltz, Gabrielle Lau, Marybeth Fair, Alice Li, William Bishop, Wei Li, Folawiyo Campbell-Ajala, Daniel Toyama, Robert Berry, Divya Tyamagundlu, Timothy Lillicrap, Oriana Riva

Finally, we conduct a robustness analysis by testing M3A against a range of task variations on a representative subset of tasks, demonstrating that variations in task parameters can significantly alter the complexity of a task and therefore an agent's performance, highlighting the importance of testing agents under diverse conditions.

Benchmarking

Latent State Estimation Helps UI Agents to Reason

no code implementations17 May 2024 William E Bishop, Alice Li, Christopher Rawles, Oriana Riva

In the context of autonomous UI agents we then show that LLMs used in this manner are more than $76\%$ accurate at inferring various aspects of latent state, such as performed (vs. commanded) actions and task progression.

UINav: A Practical Approach to Train On-Device Automation Agents

no code implementations15 Dec 2023 Wei Li, Fu-Lin Hsu, Will Bishop, Folawiyo Campbell-Ajala, Max Lin, Oriana Riva

Automation systems that can autonomously drive application user interfaces to complete user tasks are of great benefit, especially when users are situationally or permanently impaired.

Android in the Wild: A Large-Scale Dataset for Android Device Control

3 code implementations19 Jul 2023 Christopher Rawles, Alice Li, Daniel Rodriguez, Oriana Riva, Timothy Lillicrap

The dataset contains human demonstrations of device interactions, including the screens and actions, and corresponding natural language instructions.

Lexi: Self-Supervised Learning of the UI Language

1 code implementation23 Jan 2023 Pratyay Banerjee, Shweti Mahajan, Kushal Arora, Chitta Baral, Oriana Riva

Along with text, these resources include visual content such as UI screenshots and images of application icons referenced in the text.

Image Retrieval Language Modelling +2

Inducing a hierarchy for multi-class classification problems

no code implementations20 Feb 2021 Hayden S. Helm, Weiwei Yang, Sujeeth Bharadwaj, Kate Lytvynets, Oriana Riva, Christopher White, Ali Geisa, Carey E. Priebe

In applications where categorical labels follow a natural hierarchy, classification methods that exploit the label structure often outperform those that do not.

Classification Clustering +2

Bew: Towards Answering Business-Entity-Related Web Questions

no code implementations10 Dec 2020 Qingqing Cao, Oriana Riva, Aruna Balasubramanian, Niranjan Balasubramanian

We present a practical approach, called BewQA, that can answer Bew queries by mining a template of the business-related webpages and using the template to guide the search.

Cannot find the paper you are looking for? You can Submit a new open access paper.