Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ENH] publication_date parameter #221

Open
schmucr1 opened this issue Sep 6, 2024 · 0 comments
Open

[ENH] publication_date parameter #221

schmucr1 opened this issue Sep 6, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@schmucr1
Copy link

schmucr1 commented Sep 6, 2024

Hello !

Just a question that I'd like to ask here, as I cannot use Slack.

After how much time (hours, days) are GEO studies searchable with the pysradb Python API?

When I set the publication date to today - 1 day, then no results are found,

nstance = GeoSearch(publication_date="05-09-2024:06-09-2024", return_max=100, verbosity=3) ; instance.search(); df=instance.get_df(); print(df); print(df['study_alias'].unique()) No results found for the following search query: SRA: {'query': 'sra gds[Filter]', 'accession': None, 'organism': None, 'layout': None, 'mbases': None, 'publication_date': '05-09-2024:06-09-2024', 'platform': None, 'selection': None, 'source': None, 'strategy': None, 'title': None} GEO DataSets: {'query': 'gds sra[Filter]', 'dataset_type': None, 'entry_type': None, 'publication_date': '05-09-2024:06-09-2024', 'organism': None} Empty DataFrame Columns: [] Index: [] Traceback (most recent call last): File "<stdin>", line 1, in <module> File "/home/app_user/.local/lib/python3.10/site-packages/pandas/core/frame.py", line 3761, in __getitem__ indexer = self.columns.get_loc(key) File "/home/app_user/.local/lib/python3.10/site-packages/pandas/core/indexes/range.py", line 349, in get_loc raise KeyError(key) KeyError: 'study_alias'

Although there are bulk RNAseq studies published on Sep 5, e.g., GSE274586 or GSE262282.

A study published on Sep 4, GSE260817, is also not found with publication_date parameter today - 2 days:

instance = GeoSearch(publication_date="04-09-2024:06-09-2024", return_max=1000, verbosity=3) ; instance.search(); df=instance.get_df(); print(df['study_alias'].unique()); sum(df['study_alias'].unique()=='GSE260817') 100%|███████████████████████████████████████████████████████████████████████████████████████████| 468/468 [00:23<00:00, 19.67it/s] ['GSE262126' 'GSE248977' 'GSE227333' 'GSE244392' 'GSE248058' 'GSE272252' 'GSE248629' 'GSE227216' 'GSE233998' 'GSE275863' 'GSE261310' 'GSE240264' 'GSE253407' 'GSE249182' 'GSE227338' 'GSE264212' 'GSE262125' 'GSE272635' 'GSE276038' 'GSE271653' 'GSE276204' 'GSE267342' 'GSE232874' 'GSE267343' 'GSE276065' 'GSE271667' 'GSE276245' 'GSE248627' 'GSE276037' 'GSE276206' 'GSE248628'] 0

What are the contraints?

Thank you and best regards,
Roland

@schmucr1 schmucr1 added the enhancement New feature or request label Sep 6, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant