d-matrix
/

gpt-j-6b

Text Generation

Inference Endpoints

Model card Files Files and versions Community

gpt-j-6b / README.md

d-matrix

Update README.md

67938d7 verified 9 months ago

|

1.07 kB

	---
	language:
	- en
	tags:
	- pytorch
	- causal-lm
	license: apache-2.0
	datasets:
	- EleutherAI/pile
	---
	This is a d-Matrix functional reference of the [EleutherAI/gpt-j-6b](https://huggingface.co/EleutherAI/gpt-j-6b) model.

	The reference provides the following functional configurations:
	Configuration \| Explanation
	:-- \| :--
	`BASELINE` \| a reference functionally equivalent to the original model
	`BASIC` \| all linear algebraic operands quantized to `BFP16-64`, and all other operations transformed to approximated kernel simulations

	### Usage

	Install d-Matrix [ML Tools](https://github.com/d-matrix-ai/dmx-mltools) first.

	```sh
	pip install dmx-mltools
	```

	The following is an example model and its evaluation.

	```python
	from mltools.dmx import pipeline

	pipe = pipeline(
	task="text-generation",
	model="d-matrix/gpt-j-6b",
	dmx_config="BASELINE", # see above for other variants
	)

	results = pipe.evaluate(
	metric="d-matrix/dmx_perplexity",
	dataset="wikitext",
	dataset_version="wikitext-2-raw-v1",
	)
	```

	### Evaluation results