Add sorting for aggregating data for goal state shifts 50e921d Christina Theodoris commited on Jul 19, 2023
Fix min_genes to be >= tokens to perturb as a group 268e566 Christina Theodoris commited on Jul 19, 2023
Update tokenizer to allow tokenization without custom cell attributes 57b9778 Christina Theodoris commited on Jul 16, 2023
Update isp to allow modeling single perturbation in multiple cells as batches acd253c Christina Theodoris commited on Jul 16, 2023
Change list of individual IDs to set to ensure unique before subsetting into train/valid/test sets 45b9d69 Christina Theodoris commited on Jul 13, 2023
Update internal format of anchor token to list for consistency with genes to perturb b36d210 Christina Theodoris commited on Jul 10, 2023
Add filtering for start state cells prior to in silico perturbation when modeling cell states bb217cf Christina Theodoris commited on Jul 9, 2023
Correct order of state dict in in silico perturber stats and tensor dims of alt state emb in in silico perturber 3d06203 Christina Theodoris commited on Jul 8, 2023
Add explanation of output columns and sort by largest shift 3072225 Christina Theodoris commited on Jul 8, 2023
Add note to recommend tuning hyperparameters for downstream applications 98ce6d7 Christina Theodoris commited on Jul 7, 2023
Add instructions to convert gene annotations w/ Ensembl Biomart 77eb432 Christina Theodoris commited on Jul 6, 2023
Modify tokenizer to allow renaming attr names btwn loom and .dataset e78c44d Christina Theodoris commited on Jul 3, 2023
Modify quant_layers to convert layer nums to integers before calculating max 2181aa4 Christina Theodoris commited on Jul 3, 2023
Remove unused evalset variable from cell classification example 879a878 Christina Theodoris commited on Jul 1, 2023
Modify documentation for modeling only 2 cell states 019165f Christina Theodoris commited on Jun 27, 2023
Add instructions for modeling only 2 states and modify stats script for that option 912860d Christina Theodoris commited on Jun 27, 2023
Update links including unsorted example lengths file f0b6641 Christina Theodoris commited on Jun 25, 2023
Add function to create remainder emb for in silico overexpression batch feeecd0 Christina Theodoris commited on Jun 23, 2023
Add mixture model option for gene-gene interaction stats dc1481d Christina Theodoris commited on Jun 22, 2023
Add stats with mixture model to determine whether test perturbation is in impact component d20ad0a Christina Theodoris commited on Jun 20, 2023
Add example code for obtaining non-zero median digests 67b7f01 Christina Theodoris commited on Jun 16, 2023
Fix filter_data to allow value of None for no filtering 5fcf2b8 Christina Theodoris commited on Jun 15, 2023
Add further explanation regarding input file format for transcriptome tokenizer c34ead6 Christina Theodoris commited on Jun 12, 2023
Update instructions in tokenizing example to clarify requirement for Ensembl ID d468697 Christina Theodoris commited on Jun 11, 2023
Update instructions in tokenizing example to clarify requirement for Ensembl ID e606e1c Christina Theodoris commited on Jun 11, 2023
Add further explanation to tokenizer example script and updated tokenizer to match loompy raised error 78dd83b Christina Theodoris commited on Jun 10, 2023
Move example input files to dataset repository to include example datasets for fine-tuning 875ef33 Christina Theodoris commited on Jun 9, 2023
Update gene classification example to create directory after training arguments are defined ebe5ee8 Christina Theodoris commited on Jun 9, 2023
Fixes to stats and adding gene dict attempt number 2 (#13) 42e9bf9 ctheodoris commited on Jun 7, 2023
Moving merged in_silico_perturber_stats.py to geneformer folder 9f2c6cc ctheodoris commited on Jun 6, 2023
Added comparison to null distribution for stats (#9) 7d74c82 ctheodoris davidjwen commited on Jun 6, 2023
Add example input labels for distinguishing bivalent promoters. 996d3e5 Christina Theodoris commited on Jun 6, 2023