Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
ctheodoris
/
Geneformer
like
199
Fill-Mask
Transformers
Safetensors
ctheodoris/Genecorpus-30M
bert
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
443
Train
Deploy
Use this model
eb2a04b
Geneformer
/
geneformer
14 contributors
History:
92 commits
ctheodoris
add random state to umap
eb2a04b
4 months ago
__init__.py
656 Bytes
fix cell state gene embeddings bug (#345)
5 months ago
classifier.py
60.7 kB
patch datasets save_to_disk
5 months ago
classifier_utils.py
16.7 kB
Update for gene classification (#330)
7 months ago
collator_for_classification.py
29.8 kB
Subclass collator for cell classification
over 1 year ago
emb_extractor.py
31.5 kB
add random state to umap
4 months ago
evaluation_utils.py
9.76 kB
update examples for predict_eval and handle roc for 2 cell classes
8 months ago
gene_median_dictionary.pkl
pickle
Detected Pickle imports (2)
"numpy.dtype"
,
"numpy.core.multiarray.scalar"
How to fix it?
941 kB
Add Geneformer tokenizer and updated model card
about 2 years ago
gene_name_id_dict.pkl
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
1.12 MB
LFS
Upload gene_name_id_dict.pkl (#14)
over 1 year ago
in_silico_perturber.py
39.2 kB
update get_embs with token_gene_dict arg
4 months ago
in_silico_perturber_stats.py
44.2 kB
update perturber stats to reflect cos sim and emb_extractor to suppress warnings for non-cls
4 months ago
perturber_utils.py
26.9 kB
Add function to get number of model embeddings (#364)
4 months ago
pretrainer.py
37.1 kB
update kwargs for pretrainer
6 months ago
token_dictionary.pkl
pickle
Detected Pickle imports (2)
"numpy.dtype"
,
"numpy.core.multiarray.scalar"
How to fix it?
788 kB
Update token_dictionary.pkl
over 1 year ago
tokenizer.py
16.5 kB
update tokenizer to include eos token
5 months ago