Commit History
remove token dictionary and unpickling from init (#403)
7eca269
verified
move dict loading to function in eval utils
57bc17e
ctheodoris
commited on
edit docs formatting
ef094b2
ctheodoris
commited on
update tokenizer to defaults for 95M models for special token and input size
da8cf3d
verified
ctheodoris
commited on
pointing dictionaries from the mtl module's init (#397)
7470753
verified
Refactored token dictionary loading and encapsulated dictionary (#398)
beb62a4
verified
Refactor: Convert mask_token_id, pad_token_id, and all_special_ids to properties (#395)
2e06f1a
verified
sync token_dictionary variable name w/ classifier
a021deb
verified
ctheodoris
commited on
fix imports mtl/eval_utils
eab1878
ctheodoris
commited on
allow model_type valid options to take params model_type : {"Pretrained", "GeneClassifier", "CellClassifier", "MTLCellClassifier", "MTLCellClassifier-Quantized"} (#390)
47e0ef8
verified
"save_model_without_heads" is redundant (#385)
de10ab0
verified
comment out "def save_model_without_heads(original_model_save_directory)"; redundant for ISP/Emb extractor (#382)
22bf20f
verified
fixed bug related to dynamic ranges in dictionary with 'min' and 'max' value mismatch in optuna suggest fn (#380)
fe1640b
verified
precommit formatting
f07bfd7
ctheodoris
commited on
update with 12L and 20L i4096 gc95M models, multitask and quantiz code
933ca80
ctheodoris
commited on
rename for consistency
ec19834
verified
ctheodoris
commited on
delete old gene name dict
817eca2
verified
ctheodoris
commited on
update to only have gene names as keys in gene_name_id_dict
e61485e
verified
ctheodoris
commited on
Add function for summing of Ensembl IDs (#377)
1e18102
verified
save pval
b07f4b1
ctheodoris
commited on
add typing list import
42053dc
ctheodoris
commited on
move dicts to init
ea428cb
ctheodoris
commited on
add random state to umap
eb2a04b
ctheodoris
commited on
update get_embs with token_gene_dict arg
ace12e9
verified
ctheodoris
commited on
update refs to get_model_emb_dims
3fe35ba
verified
ctheodoris
commited on
embs_df with all model embeddings (#363)
2e64874
verified
Add function to get number of model embeddings (#364)
c90d791
verified
clone embs_i to resolve memory leak in cls embs
57f02a4
ctheodoris
commited on
update perturber stats to reflect cos sim and emb_extractor to suppress warnings for non-cls
25dd1da
ctheodoris
commited on
update to account for set of perturbed genes with aggregate_data
eb038a6
ctheodoris
commited on
update to enable cls emb
b2bbd7c
ctheodoris
commited on
update tokenizer to include eos token
ead0550
Christina Theodoris
commited on
Update geneformer/emb_extractor.py (#350)
471eefc
verified
Upload in_silico_perturber_stats.py (#313)
8aee0ff
verified
fix cell state gene embeddings bug (#345)
c0e7b19
verified
ctheodoris
commited on
patch datasets save_to_disk
75c67a1
Christina Theodoris
commited on
update kwargs for pretrainer
fb130e6
Christina Theodoris
commited on
refer to token dictionary in self
86fe0dd
Christina Theodoris
commited on
Update for gene classification (#330)
94095d1
verified
Update with gene classifier, custom token dict, and str validate options (#329)
0568479
verified
add option for hyperparameter tuning to cc.validate
4bddd45
Christina Theodoris
commited on
correct typo
5a43832
verified
ctheodoris
commited on
update examples for predict_eval and handle roc for 2 cell classes
eeba323
Christina Theodoris
commited on
Update readthedocs for classifier
f75f5ac
Christina Theodoris
commited on
Get the gene keys and gene list keys from the token dictionary instead of medians (#304)
b294421
verified
Prevent ruff/isort on init
941390d
Christina Theodoris
commited on
Add classifier module and examples
9e9cca9
Christina Theodoris
commited on