Commit History
add input size tip to instructions
2732369
verified
ctheodoris
commited on
update instructions to include reminder about token dictionary
cb1b0d5
verified
ctheodoris
commited on
precommit formatting
f07bfd7
ctheodoris
commited on
add option for hyperparameter tuning to cc.validate
4bddd45
Christina Theodoris
commited on
update examples for predict_eval and handle roc for 2 cell classes
eeba323
Christina Theodoris
commited on
Add classifier module and examples
9e9cca9
Christina Theodoris
commited on
Add functions for extracting gene embeddings, move state_embs_dict outside isp, fix bugs in isp
2f25aea
Christina Theodoris
commited on
Add option to output embs as tensor
624349c
Christina Theodoris
commited on
anndata_tokenizer (#170)
4302f48
Config head node runtime via IP (#234)
b8fda63
Fixed error with perturbing individual genes and updated ways to specify cell_states_to_model (#146)
9169bfd
Rename heatmap legend to be correct label
badcca6
Christina Theodoris
commited on
Add function to extract and plot cell embeddings
d154fee
Christina Theodoris
commited on
Update isp to allow modeling single perturbation in multiple cells as batches
acd253c
Christina Theodoris
commited on
Change list of individual IDs to set to ensure unique before subsetting into train/valid/test sets
45b9d69
Christina Theodoris
commited on
Add note to recommend tuning hyperparameters for downstream applications
98ce6d7
Christina Theodoris
commited on
Add instructions to convert gene annotations w/ Ensembl Biomart
77eb432
Christina Theodoris
commited on
Remove unused evalset variable from cell classification example
879a878
Christina Theodoris
commited on
Update links including unsorted example lengths file
f0b6641
Christina Theodoris
commited on
Add uniform max len for padding for predictions
67f674c
Christina Theodoris
commited on
Add example code for obtaining non-zero median digests
67b7f01
Christina Theodoris
commited on
Add further explanation regarding input file format for transcriptome tokenizer
c34ead6
Christina Theodoris
commited on
Update instructions in tokenizing example to clarify requirement for Ensembl ID
d468697
Christina Theodoris
commited on
Update instructions in tokenizing example to clarify requirement for Ensembl ID
e606e1c
Christina Theodoris
commited on
Add further explanation to tokenizer example script and updated tokenizer to match loompy raised error
78dd83b
Christina Theodoris
commited on
Move example input files to dataset repository to include example datasets for fine-tuning
875ef33
Christina Theodoris
commited on
Add example for tokenizing .loom dataset
f89f796
Christina Theodoris
commited on
Update gene classification example to create directory after training arguments are defined
ebe5ee8
Christina Theodoris
commited on
Subclass collator for cell classification
402ba9b
Christina Theodoris
commited on
Update pretrainer for transformers==4.28.0
b925dcc
Christina Theodoris
commited on
Add example input labels for distinguishing bivalent promoters.
996d3e5
Christina Theodoris
commited on
Add in silico perturbation example notebook.
1c2f864
Christina Theodoris
commited on
Add example input files for gene classification of TF dosage sensitivity
0710c44
Christina Theodoris
commited on
Add example for hyperparameter optimization for disease classifier
79a0c41
Christina Theodoris
commited on
Add data collator for gene classification and example for gene classification
edacd12
Christina Theodoris
commited on
Add data collator for cell classification and example for cell classification
088ea6e
Christina Theodoris
commited on
Add Geneformer trainer and pretraining example
bcc03e8
Christina Theodoris
commited on