feat(bigframes): Support loading avro, orc data#16555
feat(bigframes): Support loading avro, orc data#16555TrevorBergeron wants to merge 4 commits intomainfrom
Conversation
There was a problem hiding this comment.
Code Review
This pull request introduces support for reading ORC and Avro files into BigQuery DataFrames by implementing read_orc and read_avro methods in the Session class and providing corresponding API wrappers. Review feedback identifies a bug in the system tests where to_orc is called on a BigFrames DataFrame instead of a pandas DataFrame. Additionally, several improvements are suggested to maintain alphabetical order in imports and function definitions, along with a minor wording update for an error message to improve clarity.
| The write engine used to persist the data to BigQuery if needed. | ||
|
|
||
| Returns: | ||
| bigframes.dataframe.DataFrame: |
There was a problem hiding this comment.
Nit: in docs, use bigframes.pandas.DataFrame so that we link here: https://dataframes.bigquery.dev/reference/api/bigframes.pandas.DataFrame.html#bigframes.pandas.DataFrame
This is less of a concern now that we've migrated off of Cloud RAD onto plain sphinx, which does dedupe aliases, AFAIK, but I'd like to ensure we keep consistency.
| The engine used to read the file. Only `bigquery` is supported for Avro. | ||
|
|
||
| Returns: | ||
| bigframes.dataframe.DataFrame: |
There was a problem hiding this comment.
Same here: bigframes.pandas.DataFrame in docs.
| bigframes.dataframe.DataFrame: | ||
| A new DataFrame representing the data from the Avro file. | ||
| """ | ||
| if engine not in ("auto", "bigquery"): |
There was a problem hiding this comment.
No action required, but I did see https://arrow.apache.org/blog/2025/10/23/introducing-arrow-avro/ last year, which should be enough to unlock a potential upstream contribution for a read_avro method in pandas.
Thank you for opening a Pull Request! Before submitting your PR, there are a few things you can do to make sure it goes smoothly:
Fixes #<issue_number_goes_here> 🦕