mirror of
				https://github.com/labmlai/annotated_deep_learning_paper_implementations.git
				synced 2025-10-31 02:39:16 +08:00 
			
		
		
		
	
		
			
				
	
	
	
		
			3.3 KiB
		
	
	
	
	
	
	
	
			
		
		
	
	
			3.3 KiB
		
	
	
	
	
	
	
	
GPT-NeoX
This is a simple implementation of Eleuther GPT-NeoX for inference and fine-tuning.
Samples
Evaluation
Evaluation Results
| Task | Metric | NeoX Impl (2 GPU) | This repo (1 GPU) | LLM.Int8 | 
|---|---|---|---|---|
| anli_r1 | acc | 0.3270 | 0.3360 | 0.3440 | 
| acc_stderr | 0.0148 | 0.0149 | 0.0150 | |
| anli_r2 | acc | 0.3410 | 0.3350 | 0.3540 | 
| acc_stderr | 0.0150 | 0.0149 | 0.0151 | |
| anli_r3 | acc | 0.3567 | 0.3525 | 0.3567 | 
| acc_stderr | 0.0138 | 0.0149 | 0.0138 | |
| hellaswag | acc | 0.5351 | 0.5353 | 0.5348 | 
| acc_stderr | 0.0050 | 0.0050 | 0.0050 | |
| acc_norm | 0.7140 | 0.7145 | 0.7132 | |
| acc_norm_stderr | 0.0045 | 0.0045 | 0.0045 | |
| lambada | acc | 0.7211 | 0.7204 | 0.7155 | 
| acc_stderr | 0.0062 | 0.0063 | 0.0063 | |
| ppl | 3.6760 | 3.6375 | 3.7245 | |
| ppl_stderr | 0.0760 | 0.0747 | 0.0768 | |
| piqa | acc | 0.7748 | 0.7758 | 0.7769 | 
| acc_stderr | 0.0097 | 0.0097 | 0.0097 | |
| acc_norm | 0.7786 | 0.7845 | 0.7829 | |
| acc_norm_stderr | 0.0097 | 0.0096 | 0.0096 | |
| winogrande | acc | 0.6598 | 0.6582 | 0.6606 | 
| acc_stderr | 0.0133 | 0.0133 | 0.0133 | |
| wsc | acc | 0.5096 | 0.5000 | 0.5288 | 
| acc_stderr | 0.0493 | 0.0493 | 0.0492 | |
| mathqa | acc | 0.2720 | ||
| acc_stderr | 0.0081 | |||
| acc_norm | 0.2727 | |||
| acc_norm_stderr | 0.0082 | 
Official Eleuther GPT-NoeX is source code is available at eleutherai/gpt-neox.
