Anagha V | NID | Interaction Designer

Copilot

A new touch to Microsoft Copilot

Multimodality

Redesign

Objective

To identify areas for improvement in the copilot AI assistant and to devise a redesign solution that enhances usability by integrating multimodal interactions.

Scope

Interventions in the multiple modal interactions such as gestures and haptic feedback into Copilot can significantly enhance the user experience.

2 weeks

Duration

UX research

UX/ UI mobile design

Prototyping

Responsibilities

Figma

Tools

/ Process

The entire process included three phases - Research phase, Ideation phase and Execution phase

RESEARCH

Desk & primary research

Existing task flow

Identifying gaps

IDEATION

Gather insights

Problem statement

Potential ideas

EXECUTION

Wireframe

Screen prototype

Prototype testing

RESEARCH

/ Desk Research (Understanding more about Copilot)

After the introduction of AI in doing day to day life tasks, people have started depending on generative AI software to increase productivity. Microsoft Copilot app having more than 10Million downloads, was created to help users with a variety of tasks, including text generation, document summarization, to-do list creation, and email management. Microsoft’s Copilot AI takes technology from OpenAI, including ChatGPT and Dall-E, and blends it with its own web data from Bing.

One of the significant challenges existing in the application is the image generation experience. Recognizing this need for enhancement, I’ve aimed to address this particular challenge.

/Feature Comparison (Understanding the features in other generative AI applications)

RESEARCH

Uses Google’s Imagen 2 model for generating images based on text prompts.

Does not provide built-in image editing capabilities.

Easy download from the chat page itself.

No option to save/ collect favorite images

Possible to view one image after another while viewing full screen.

Images can be saved but there is no accessibility to these saved images.

Not possible to view one image after another while viewing full screen.

Need to go to the edit page to download the image.

Provide built-in image editing capabilities.

Uses DALL-E 3 from OpenAI for generating images based on text prompts.

RESEARCH

/Existing Task Flow (Finding gaps or opportunities for intervention)

Voice Input

Text Input

Chat Screen

Camera Screen

Image Input (Camera & Gallery)

VOICE INPUT FOR SEARCHING DOES NOT WORK SIMULTANEOUSLY WHEN ACCESSING CAMERA

LACK OF FEEDBACK (OTHER THAN VISUAL) TO INDICATE THAT IMAGE IS BEING GENERATED WHEN THE APP IS KEPT OPEN IN THE TAB

GAPS IDENTIFIED ARE NOTED DOWN IN STICKY NOTES

Image Input via camera

Searching with reference via text/ voice input

Selecting any image for reference

Select from Gallery

Generating images using an image reference

Generated Images (Four by default)

Icon to tap to “Save to Collections

Selecting one image to add it to “Save to Collections”

Selecting & Closing the pop-up to select the next image

NO WAY TO MULTISELECT IMAGES IN ONE-GO TO ADD THEM TO " Save to Collections "

Adding 2 generated images to “Save to collections”

Tap on the image to select

13:57

Designer

NO WAY TO VIEW THE ENLARGER VERSION OF THE IMAGE WITHOUT GOING TO THE EDIT SCREEN

Art style options to choose from

Edit Screen

Chat Screen

Tap the kebab menu to view options

13:57

Designer

Download

More information

MULTIPLE STEPS TO JUST DOWNLOAD OR SHARE ONE IMAGE

Download

More information

Close the edit page to go back and view other images

NEED TO GO BACK AND FORTH BETWEEN THE CHAT SCREEN AND EDIT SCREEN TO VIEW THE ENLARGED VERSION OF AN IMAGE

Designer

“Create an image of a top view of a small round table with... “

Downloading one of the generated images

/ User Reviews (Validating the identified problem & getting further inputs)

RESEARCH

Collecting reviews from real users

google play reviews

Identifying relevant reviews for the research

No toggle between landscape and portrait mode as default

Maybe ai itself can adjust the canvas ratio for images

No way to upload a pdf doc to ask questions regarding it

Textbox obscures the view while generating an answer

No response from ai, forcing to start a new conversation or to leave the app entirely

Images created during a session is not accessible afterward

Can’t find saved images.

Need to go to editor to download an image

Selecting some reviews for design considerations

No toggle between landscape and portrait mode as default

Maybe ai itself can adjust the canvas ratio for images

No way to upload a pdf doc to ask questions regarding it

Textbox obscures the view while generating an answer

No response from ai, forcing to start a new conversation or to leave the app entirely

Images created during a session is not accessible afterward

Can’t find saved images.

Need to go to editor to download an image

It would be better if I didn’t have to click a picture and then use voice search later. Maybe AI can listen to me when I point at something with the camera and ask questions. I don’t know. Something like that might work better.

It is tough when I want to add all the images to the collections at once. I’ll have to add one at a time. And I can’t find these images later. I don’t know where to look for it.

Whenever I want to see an image better, I must select, edit, and return to select the other image. Imagine doing it for, like... four times continuously. It’s frustrating for me. Copilot could’ve done better.

/ User Interviews (Validating the identified problem & getting further inputs)

RESEARCH

After the desk research with basic feature comparison, task flow study and user review analysis, I went on with interviewing a few people who have used or is still using copilot mobile app. Below are some of the most critical feedback from th interviews.

/ Observations (Insights from observing user interactions with Copilot)

RESEARCH

The user tapped on the same image more than once because she missed the fast stand-alone visual feedback.

The user is only able to select the camera option after multiple tries.

First-time users try to get the next image using the known gesture of swiping following (like swiping images in your phone gallery.

To download a generated image without going back and forth between chat and edit screens

Real time inputs to search when using camera instead of going to chat screen for giving audio input

Proper confirmation feedbacks for improving user engagement and error reduction

To view one image after another while viewing the image on full screen

Option for enlarged preview of one image without limiting the view of the other generated images

To select more than one image at a time to add them to “save to collections”

/User Needs (Needs identified from secondary research)

/Problem Statement (Defining the problem after secondary research)

Upon completing my research, I zeroed in on a pivotal question:

How can we integrate effortless and straightforward interactions into the image generation process of the Microsoft Copilot mobile app to improve user experience?

Haptic Feedbacks for Confirmation: Providing apt haptic feedbacks along with existing visual feedbacks for improving user engagement and error reduction.

Real-Time Voice Search: Integrating real-time voice search into the camera feature to reduce user effort by allowing audio input directly, without the need to first take a picture and then switch to the chat screen.

Multiple Image Selection: Adding the option to select multiple images at once for ‘Save to Collections,’ reducing the steps needed when users want to choose more than one image.

Magnified Image Preview: Enabling image enlargement for users to view all images without toggling between chat and edit screens.

Direct download & Share: Providing option to download or share images directly in chat screen, skipping the need to visit the edit screen for the task.

Gesture to access camera: Enabling camera access through touch gestures to reduce fat finger errors and minimize the number of attempts.

Redesign Goals:

Through the desk research, I discerned that users primarily focus on three key aspects: generating images, adding images to “save to collections”, downloading/ sharing an image. Focusing on these key aspects, I formulated the following redesign goals for Copilot:

IDEATION

/Redesign Goals (Goals formed from the identified user needs)

/ Initial Ideations (Exploring various solutions to meet the redesign objectives)

IDEATION

Image preview

Long press and hold for preview

Image multi-selection

Long press one of the icons and drag to select the rest of the images for multiselecting

Can the preview size be increased further for a better image view?

Is there a need for one more step to confirm selections?