Adds new GoaT NLP UI along with seamless streaming between frontend and servers #33

deepnayak · 2024-09-18T12:50:34Z

Summary by Sourcery

Add a new GoaT NLP UI with seamless streaming between frontend and servers, enhancing user interaction. Introduce new chat features like voice input, code highlighting, and model switching. Update the installation guide and add comprehensive tests for the query pipeline components.

New Features:

Introduce a new GoaT NLP UI with seamless streaming capabilities between the frontend and servers, enhancing user interaction and data processing.
Add a new chat interface with features like voice input support, code syntax highlighting, and model switching for a more interactive user experience.
Implement a new sidebar for chat history and user settings, allowing users to manage their chats and preferences easily.
Introduce a new testing guide in the documentation to help developers run tests efficiently.

Enhancements:

Refactor the query classification logic to improve the accuracy of taxon and assembly identification in user queries.
Enhance the chat UI with a responsive design, making it fully functional on both desktop and mobile devices.
Improve the chat message handling by adding a debug element for better error tracking and resolution.

Build:

Update the build configuration to support the new UI components and ensure compatibility with the latest dependencies.

Documentation:

Update the installation guide to include steps for setting up the new UI and running tests.

Tests:

Add comprehensive tests for the query pipeline components to ensure the reliability and accuracy of the NLP functionalities.

Refined logic for time based queries

…entQP

…rection

…bug_output_html

Improved test cases to suit the new pipeline

sourcery-ai · 2024-09-18T12:50:39Z

Reviewer's Guide by Sourcery

This pull request introduces a new GoaT NLP UI with seamless streaming between frontend and servers. The changes include updates to the backend Python code for handling queries and processing data, as well as the addition of a new Next.js-based frontend UI with various components for chat functionality, model selection, and user interactions.

File-Level Changes

Change	Details	Files
Updated backend query processing and data handling	Modified prompt templates for query classification and entity identification Added new functions for attribute identification and condition definition Updated the query pipeline to include new steps and improve data flow	`src/prompt.py` `src/agent/component_helpers.py` `src/agent/query_pipeline.py`
Implemented new Next.js-based frontend UI	Created chat components for message display and input Added user settings and model selection functionality Implemented dark mode and theme switching Created API routes for chat and model management	`ui/src/app/page.tsx` `ui/src/components/chat/chat-layout.tsx` `ui/src/components/chat/chat-list.tsx` `ui/src/components/chat/chat-bottombar.tsx` `ui/src/components/user-settings.tsx` `ui/src/app/api/chat/route.ts` `ui/src/app/api/model/route.ts`
Enhanced streaming capabilities between frontend and backend	Implemented server-sent events for real-time updates Added support for speech-to-text input in the chat interface Created a streaming response handler in the backend	`src/app.py` `ui/src/app/page.tsx` `ui/src/app/hooks/useSpeechRecognition.ts`
Added UI components and utilities	Created reusable UI components like buttons, dialogs, and dropdowns Implemented utility functions for theming and styling Added emoji picker and code syntax highlighting features	`ui/src/components/ui/button.tsx` `ui/src/components/ui/dialog.tsx` `ui/src/components/ui/dropdown-menu.tsx` `ui/src/lib/utils.ts` `ui/src/components/code-display-block.tsx` `ui/src/components/emoji-picker.tsx`

Tips

Trigger a new Sourcery review by commenting @sourcery-ai review on the pull request.
Continue your discussion with Sourcery by replying directly to review comments.
You can change your review settings at any time by accessing your dashboard:
- Enable or disable the Sourcery-generated pull request summary or reviewer's guide;
- Change the review language;
You can always contact us if you have any questions or feedback.

sourcery-ai

Hey @deepnayak - I've reviewed your changes - here's some feedback:

Overall Comments:

This PR adds a new GoaT NLP UI with streaming functionality between the frontend and servers. It includes significant updates to the chat interface, new components for handling user input and displaying responses, and integration with the GoaT API.
The changes introduce a more robust error handling system and improved state management for the chat application. However, it would be beneficial to add more comprehensive documentation for the new components and functions to aid future maintenance and development.

Here's what I looked at during the review

🟡 General issues: 5 issues found
🟡 Security: 1 issue found
🟢 Testing: all looks good
🟡 Complexity: 4 issues found
🟡 Documentation: 4 issues found

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment to tell me if it was helpful.}

sourcery-ai · 2024-09-18T12:52:20Z

src/app.py

@@ -29,6 +34,7 @@
 LlamaIndexInstrumentor().instrument(tracer_provider=tracer_provider)

 app = Flask("goat_nlp")
+CORS(app)


🚨 suggestion (security): Specify allowed origins for CORS

For better security, consider specifying allowed origins rather than allowing all origins. This helps prevent unauthorized access from potentially malicious sources.

Suggested change

CORS(app)

CORS(app, resources={r"/*": {"origins": ["https://yourdomain.com", "http://localhost:3000"]}})

sourcery-ai · 2024-09-18T12:52:20Z

src/agent/component_helpers.py

@@ -27,6 +28,7 @@
 def identify_index(input: str, state: Dict[str, Any]):
    index_response = Settings.llm.complete(INDEX_PROMPT.format(query=input)).text
    state["index"] = json.loads(extract_json_str(index_response))
+    state["status"] = "Identify Index"


suggestion: Consider using an enum for status values

Using an enum for status values would provide better type safety and prevent potential typos in status strings. It would also make it easier to manage and update the list of possible statuses.

from enum import Enum class Status(Enum): IDENTIFY_INDEX = "Identify Index" state["status"] = Status.IDENTIFY_INDEX

sourcery-ai · 2024-09-18T12:52:20Z

ui/src/components/chat/chat.tsx

+import { ChatRequestOptions } from "ai";
+import { v4 as uuidv4 } from "uuid";
+
+export interface ChatProps {


suggestion: Consider grouping related props to improve maintainability

The ChatProps interface has many properties. Consider grouping related props into sub-objects (e.g., messageProps, inputProps) to improve readability and maintainability. This could make the component easier to use and understand.

export interface ChatProps { messageProps: { chatId?: string; // other message-related props }; inputProps: { setSelectedModel: React.Dispatch<React.SetStateAction<string>>; // other input-related props }; }

sourcery-ai · 2024-09-18T12:52:20Z

ui/tailwind.config.ts

+    './src/**/*.{ts,tsx}',
+	],
+  prefix: "",
+  theme: {


suggestion: Optimize color definitions using Tailwind's opacity modifiers

There's repetition in color definitions. Consider using Tailwind's color opacity modifiers (e.g., 'primary/80' for 80% opacity) to reduce repetition and make the config more maintainable.

theme: { extend: { colors: { primary: { DEFAULT: '#3490dc', '80': 'rgba(52, 144, 220, 0.8)', '60': 'rgba(52, 144, 220, 0.6)', '40': 'rgba(52, 144, 220, 0.4)', '20': 'rgba(52, 144, 220, 0.2)', }, }, }, container: {

sourcery-ai · 2024-09-18T12:52:20Z

ui/src/components/code-display-block.tsx

+
+  return (
+    <div className="relative my-4 overflow-scroll overflow-x-scroll  flex flex-col   text-start  ">
+      <Button


suggestion: Enhance accessibility of the copy button

Add an aria-label to the copy button to improve accessibility. For example: aria-label="Copy code to clipboard". Also, consider using a more robust solution for managing the copied state, such as a custom hook, instead of setTimeout.

<Button onClick={copyToClipboard} variant="ghost" aria-label="Copy code to clipboard"

sourcery-ai · 2024-09-18T12:52:23Z

src/agent/test_query_pipeline.py

+    for entity in entities:
+        assert (
+            (entity["scientific_name"].lower() in [x.lower() for x in expected_entities])
+            or (entity["singular_form"].lower() in [x.lower() for x in expected_entities])
+            or (entity["plural_form"].lower() in [x.lower() for x in expected_entities])
+        )


issue (code-quality): Avoid loops in tests. (no-loop-in-tests)

Explanation
Avoid complex code, like loops, in test functions.
Google's software engineering guidelines says:
"Clear tests are trivially correct upon inspection"
To reach that avoid complex code in tests:

loops

conditionals

Some ways to fix this:

Use parametrized tests to get rid of the loop.

Move the complex logic into helpers.

Move the complex part into pytest fixtures.

Complexity is most often introduced in the form of logic. Logic is defined via the imperative parts of programming languages such as operators, loops, and conditionals. When a piece of code contains logic, you need to do a bit of mental computation to determine its result instead of just reading it off of the screen. It doesn't take much logic to make a test more difficult to reason about.

Software Engineering at Google / Don't Put Logic in Tests

sourcery-ai · 2024-09-18T12:52:23Z

src/agent/test_query_pipeline.py

+    if expected_rank is None:
+        pytest.skip("No expected rank for this test case")


issue (code-quality): Avoid conditionals in tests. (no-conditionals-in-tests)

Explanation
Avoid complex code, like conditionals, in test functions.
Google's software engineering guidelines says:
"Clear tests are trivially correct upon inspection"
To reach that avoid complex code in tests:

loops

conditionals

Some ways to fix this:

Use parametrized tests to get rid of the loop.

Move the complex logic into helpers.

Move the complex part into pytest fixtures.

Complexity is most often introduced in the form of logic. Logic is defined via the imperative parts of programming languages such as operators, loops, and conditionals. When a piece of code contains logic, you need to do a bit of mental computation to determine its result instead of just reading it off of the screen. It doesn't take much logic to make a test more difficult to reason about.

Software Engineering at Google / Don't Put Logic in Tests

sourcery-ai · 2024-09-18T12:52:23Z

src/agent/test_query_pipeline.py

+    if expected_attribute is None:
+        pytest.skip("No expected attribute for this test case")


issue (code-quality): Avoid conditionals in tests. (no-conditionals-in-tests)

Explanation
Avoid complex code, like conditionals, in test functions.
Google's software engineering guidelines says:
"Clear tests are trivially correct upon inspection"
To reach that avoid complex code in tests:

loops

conditionals

Some ways to fix this:

Use parametrized tests to get rid of the loop.

Move the complex logic into helpers.

Move the complex part into pytest fixtures.

Complexity is most often introduced in the form of logic. Logic is defined via the imperative parts of programming languages such as operators, loops, and conditionals. When a piece of code contains logic, you need to do a bit of mental computation to determine its result instead of just reading it off of the screen. It doesn't take much logic to make a test more difficult to reason about.

Software Engineering at Google / Don't Put Logic in Tests

sourcery-ai · 2024-09-18T12:52:23Z

src/agent/test_query_pipeline.py

+    if expected_time_from is None and expected_time_to is None:
+        pytest.skip("No expected time for this test case")


issue (code-quality): Avoid conditionals in tests. (no-conditionals-in-tests)

Explanation
Avoid complex code, like conditionals, in test functions.
Google's software engineering guidelines says:
"Clear tests are trivially correct upon inspection"
To reach that avoid complex code in tests:

loops

conditionals

Some ways to fix this:

Use parametrized tests to get rid of the loop.

Move the complex logic into helpers.

Move the complex part into pytest fixtures.

Complexity is most often introduced in the form of logic. Logic is defined via the imperative parts of programming languages such as operators, loops, and conditionals. When a piece of code contains logic, you need to do a bit of mental computation to determine its result instead of just reading it off of the screen. It doesn't take much logic to make a test more difficult to reason about.

Software Engineering at Google / Don't Put Logic in Tests

sourcery-ai · 2024-09-18T12:52:23Z

src/agent/test_query_pipeline.py

+    if expected_attribute is None or expected_attribute_condition is None:
+        pytest.skip("No expected attribute for this test case")


issue (code-quality): Avoid conditionals in tests. (no-conditionals-in-tests)

Explanation
Avoid complex code, like conditionals, in test functions.
Google's software engineering guidelines says:
"Clear tests are trivially correct upon inspection"
To reach that avoid complex code in tests:

loops

conditionals

Some ways to fix this:

Use parametrized tests to get rid of the loop.

Move the complex logic into helpers.

Move the complex part into pytest fixtures.

Complexity is most often introduced in the form of logic. Logic is defined via the imperative parts of programming languages such as operators, loops, and conditionals. When a piece of code contains logic, you need to do a bit of mental computation to determine its result instead of just reading it off of the screen. It doesn't take much logic to make a test more difficult to reason about.

Software Engineering at Google / Don't Put Logic in Tests

…ructions

rjchallis

Looking good - a couple of thoughts:

It would be great to have a bit of text with the GoaT link - even with the larger font it is still quite hard to spot. An alternative would be to return the link after the explanation so it stays closer to the text box that the user will be focussed on.
the models in the ui portion, it seems to need llama2 - is there a way to have this use llama3.1
Running this locally I'm having trouble asking questions about the returned result - have you pushed that feature yet?

INSTALL.md

src/app.py

ui/ollama-nextjs-ui.gif

ui/public/user.jpg

ui/src/utils/initial-questions.ts

deepnayak · 2024-09-19T16:45:42Z

Looking good - a couple of thoughts:

It would be great to have a bit of text with the GoaT link - even with the larger font it is still quite hard to spot. An alternative would be to return the link after the explanation so it stays closer to the text box that the user will be focussed on.

the models in the ui portion, it seems to need llama2 - is there a way to have this use llama3.1

Running this locally I'm having trouble asking questions about the returned result - have you pushed that feature yet?

Makes sense, I will push this fix
This actually depends on the model downloaded locally. Basically the UI application makes a GET request to /api/tags which is an OLLAMA endpoint that returns the locally available models.
I have already pushed the change, but I had modified the API response JSON to remove unwanted content, maybe it is cutting out important content. Can you please tell me which queries you were facing issues with? I can try to recreate it locally and check

src/agent/test_query_pipeline.py

rjchallis · 2024-09-20T09:28:57Z

I have already pushed the change, but I had modified the API response JSON to remove unwanted content, maybe it is cutting out important content. Can you please tell me which queries you were facing issues with? I can try to recreate it locally and check

Querying the JSON context is working well now I have the models sorted out

deepnayak and others added 30 commits June 18, 2024 00:53

Added code for JSON oriented model approach

3abfcdd

chore: Update code formatting and editor settings

43c77e6

Refined logic for time based queries

refactor: Reorganize imports and update code formatting

b0ddec5

chore: Refactor build_index function to simplify code

2727fc8

Minor changes

c770b71

Minor changes

cd314da

Removed conflicting flake8 error

80e24dc

Updated model in INSTALL.md

a7f98f7

Resolved PR Comments

9a06fa5

Added query pipeline feature and agent based approach

a4d1502

Merge branch 'main' of https://github.com/genomehubs/goat-nlp into ag…

d67c1f7

…entQP

Minor improvements

42b0542

refactor: Refactor validators.py for improved JSON validation and cor…

8cdd7e4

…rection

Approach using only query pipeline

1d29fc1

Improved query pipeline

d9faf5c

Improved query pipeline

88e5736

Fixed sourcery issues

91186af

Minor changes

32aed93

Improved rank and entity selection

32dae8d

Minor changes

fe131e8

Quick Fix for record queries

c55ec68

add test framework

ffef277

add extra test case

dd95ae1

Added debug json to html

199cfb6

Styling changes

fa79d73

Merge branch 'main' of https://github.com/genomehubs/goat-nlp into de…

f93d997

…bug_output_html

Minor changes

dbab17c

Minor changes

a979244

Minor performance improvement

300c7bd

Improved query pipeline

7761c52

deepnayak added 6 commits August 28, 2024 18:44

Pipeline improvements

c79a073

Performance improvement

32ad9ad

Improved test cases to suit the new pipeline

UI and some initial streaming implementation

e7fe5e0

Fixed streaming component and displayed status at every stage

f0eef0f

Minor improvements

b9fd9e1

Overall improvements to pipeline and UI

a80cf20

sourcery-ai bot reviewed Sep 18, 2024

View reviewed changes

deepnayak added 7 commits September 18, 2024 18:30

Fix flake8 issues

e08fe5f

Fix flake8 issues

10f88df

Fix flake8 issues

5451876

Fix sourcery issues

7cef6b6

Minor Changes

4cfbf1f

Added page context to the final result and added UI installation inst…

2cd93ed

…ructions

Added emphasis to GoaT links

41cb6fe

rjchallis reviewed Sep 19, 2024

View reviewed changes

INSTALL.md Show resolved Hide resolved

src/app.py Outdated Show resolved Hide resolved

ui/ollama-nextjs-ui.gif Outdated Show resolved Hide resolved

ui/public/user.jpg Outdated Show resolved Hide resolved

ui/src/utils/initial-questions.ts Outdated Show resolved Hide resolved

Resolved PR comments

4b137f5

Minor Improvements

62e21dc

rjchallis reviewed Sep 20, 2024

View reviewed changes

src/agent/test_query_pipeline.py Outdated Show resolved Hide resolved

Fixed minor typos

72fce9a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds new GoaT NLP UI along with seamless streaming between frontend and servers #33

Adds new GoaT NLP UI along with seamless streaming between frontend and servers #33

deepnayak commented Sep 18, 2024 •

edited by sourcery-ai bot

Loading

sourcery-ai bot commented Sep 18, 2024 •

edited

Loading

sourcery-ai bot left a comment

sourcery-ai bot Sep 18, 2024

sourcery-ai bot Sep 18, 2024

sourcery-ai bot Sep 18, 2024

sourcery-ai bot Sep 18, 2024

sourcery-ai bot Sep 18, 2024

sourcery-ai bot Sep 18, 2024

sourcery-ai bot Sep 18, 2024

sourcery-ai bot Sep 18, 2024

sourcery-ai bot Sep 18, 2024

sourcery-ai bot Sep 18, 2024

rjchallis left a comment

deepnayak commented Sep 19, 2024

rjchallis commented Sep 20, 2024

	CORS(app)
	CORS(app, resources={r"/*": {"origins": ["https://yourdomain.com", "http://localhost:3000"]}})

		if expected_rank is None:
		pytest.skip("No expected rank for this test case")

		if expected_attribute is None:
		pytest.skip("No expected attribute for this test case")

		if expected_time_from is None and expected_time_to is None:
		pytest.skip("No expected time for this test case")

		if expected_attribute is None or expected_attribute_condition is None:
		pytest.skip("No expected attribute for this test case")

Adds new GoaT NLP UI along with seamless streaming between frontend and servers #33

Are you sure you want to change the base?

Adds new GoaT NLP UI along with seamless streaming between frontend and servers #33

Conversation

deepnayak commented Sep 18, 2024 • edited by sourcery-ai bot Loading

Summary by Sourcery

sourcery-ai bot commented Sep 18, 2024 • edited Loading

Reviewer's Guide by Sourcery

File-Level Changes

sourcery-ai bot left a comment

Choose a reason for hiding this comment

sourcery-ai bot Sep 18, 2024

Choose a reason for hiding this comment

sourcery-ai bot Sep 18, 2024

Choose a reason for hiding this comment

sourcery-ai bot Sep 18, 2024

Choose a reason for hiding this comment

sourcery-ai bot Sep 18, 2024

Choose a reason for hiding this comment

sourcery-ai bot Sep 18, 2024

Choose a reason for hiding this comment

sourcery-ai bot Sep 18, 2024

Choose a reason for hiding this comment

sourcery-ai bot Sep 18, 2024

Choose a reason for hiding this comment

sourcery-ai bot Sep 18, 2024

Choose a reason for hiding this comment

sourcery-ai bot Sep 18, 2024

Choose a reason for hiding this comment

sourcery-ai bot Sep 18, 2024

Choose a reason for hiding this comment

rjchallis left a comment

Choose a reason for hiding this comment

deepnayak commented Sep 19, 2024

rjchallis commented Sep 20, 2024

deepnayak commented Sep 18, 2024 •

edited by sourcery-ai bot

Loading

sourcery-ai bot commented Sep 18, 2024 •

edited

Loading