Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(Pipelines) synthetic dataframe pipeline (Sourcery refactored) #703

Merged

Conversation

sourcery-ai[bot]
Copy link
Contributor

@sourcery-ai sourcery-ai bot commented Oct 30, 2023

Pull Request #688 refactored by Sourcery.

If you're happy with these changes, merge this Pull Request using the Squash and merge strategy.

NOTE: As code is pushed to the original Pull Request, Sourcery will
re-run and update (force-push) this Pull Request with new refactorings as
necessary. If Sourcery finds no refactorings at any point, this Pull Request
will be closed automatically.

See our documentation here.

Run Sourcery locally

Reduce the feedback loop during development by using the Sourcery editor plugin:

Review changes via command line

To manually merge these changes, make sure you're on the feat/pipelines_synthetic_df branch, then run:

git fetch origin sourcery/feat/pipelines_synthetic_df
git merge --ff-only FETCH_HEAD
git reset HEAD^

Help us improve this pull request!

@sourcery-ai sourcery-ai bot requested a review from ArslanSaleem October 30, 2023 06:16
data = namespace["df"]

return data

return namespace["df"]
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Function SDFCodeExecutor.execute refactored with the following changes:

Comment on lines -28 to -32
prompt = GenerateSyntheticDfPrompt(
amount=self._amount, dataframe=context.dfs[kwargs.get("dataframe_index", 0)]
return GenerateSyntheticDfPrompt(
amount=self._amount,
dataframe=context.dfs[kwargs.get("dataframe_index", 0)],
)

return prompt
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Function SyntheticDataframePrompt.execute refactored with the following changes:

@coderabbitai
Copy link
Contributor

coderabbitai bot commented Oct 30, 2023

Important

Auto Review Skipped

Bot user detected.

To trigger a single review, invoke the @coderabbitai review command.


Tips

Chat with CodeRabbit Bot (@coderabbitai)

  • If you reply to a review comment from CodeRabbit, the bot will automatically respond.
  • To engage with CodeRabbit bot directly around the specific lines of code in the PR, mention @coderabbitai in your review comment
  • Note: Review comments are made on code diffs or files, not on the PR overview.
  • Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.

CodeRabbit Commands (invoked as PR comments)

  • @coderabbitai pause to pause the reviews on a PR.
  • @coderabbitai resume to resume the paused reviews.
  • @coderabbitai review to trigger a review. This is useful when automatic reviews are disabled for the repository.
  • @coderabbitai help to get help.
  • @coderabbitai resolve to resolve all the CodeRabbit review comments.

Note: For conversation with the bot, please use the review comments on code diffs or files.

CodeRabbit Configration File (.coderabbit.yaml)

  • You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
  • The JSON schema for the configuration file is available here.
  • If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/coderabbit-overrides.json

@codecov-commenter
Copy link

codecov-commenter commented Oct 30, 2023

Codecov Report

❗ No coverage uploaded for pull request base (feat/pipelines_synthetic_df@b92d20e). Click here to learn what that means.
The diff coverage is n/a.

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

@@                      Coverage Diff                       @@
##             feat/pipelines_synthetic_df     #703   +/-   ##
==============================================================
  Coverage                               ?   84.49%           
==============================================================
  Files                                  ?       81           
  Lines                                  ?     3709           
  Branches                               ?        0           
==============================================================
  Hits                                   ?     3134           
  Misses                                 ?      575           
  Partials                               ?        0           

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

@gventuri gventuri merged commit a66eb3d into feat/pipelines_synthetic_df Oct 30, 2023
9 checks passed
gventuri added a commit that referenced this pull request Nov 1, 2023
* feat(pipeline): Add pipeline to generate synthetic dataframe

* chore(pipeline): maintain documentation and other flows

* feat(Pipeline): test case for pipeline

* feat(cache): adding cache in pipeline context and fix leftovers

* chore(pipeline): rename and add dependency

* update poetry lock file

* refactor: minor changes from the code review

Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>

* chore: update pipeline_context.py

* chore: use PandasAI logger instead of default one

* refactor: prompt for synthetic data now accepts the amount params

* remove extra print statement

* 'Refactored by Sourcery' (#703)

Co-authored-by: Sourcery AI <>

* chore(pipeline): improve pipeline usage remove passing config to pipeline

* feat: config plot libraries (#705)

* In this commit, I introduced a new configuration parameter in our application settings that allows users to define their preferred data visualization library (matplotlib, seaborn, or plotly).
With this update, I've eliminated the need for the user to specify in every prompt which library to use, thereby simplifying their interaction with the application and increasing its versatility.

* This commit adds a configuration parameter for users to set their preferred data visualization library (matplotlib, seaborn, or plotly), simplifying interactions and enhancing the application's versatility.

* viz_library_type' in test_generate_python_code_prompt.py, resolved failing tests

---------

Co-authored-by: sabatino.severino <qrxqfspfibrth6nxywai2qifza6jmskt222howzew43risnx4kva>
Co-authored-by: Gabriele Venturi <[email protected]>

* build: use ruff for formatting

* feat: add add_message method to the agent

* Release v1.4.3

* feat: workspace env (#717)

* fix(chart): charts to save to save_chart_path

* refactor sourcery changes

* 'Refactored by Sourcery'

* refactor chart save code

* fix: minor leftovers

* feat(workspace_env): add workspace env to store cache, temp chart and config

* add error handling and comments

---------

Co-authored-by: Sourcery AI <>

---------

Co-authored-by: Gabriele Venturi <[email protected]>
Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
Co-authored-by: sourcery-ai[bot] <58596630+sourcery-ai[bot]@users.noreply.github.com>
Co-authored-by: Sab Severino <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants