[gguf] Add types #562

mishig25 · 2024-03-19T16:59:15Z

GGUF add types. Follow up to #540 (comment).

No any kind of validation, just types

cc: @biw also

mishig25 · 2024-03-19T17:04:05Z

packages/gguf/src/gguf.ts

+export type { MetadataValue, Version, GGUFMetadata, GGUFTensorInfo, GGUFParseOutput } from "./types";
+export { GGUFValueType, GGMLQuantizationType } from "./types";


is it a correct way to re-export ?

julien-c · 2024-03-20T12:02:50Z

packages/gguf/src/types.ts

+export enum GGMLQuantizationType {
+ F32 = 0,
+ F16 = 1,
+ Q4_0 = 2,
+ Q4_1 = 3,
+ Q5_0 = 6,
+ Q5_1 = 7,
+ Q8_0 = 8,
+ Q8_1 = 9,
+ Q2_K = 10,
+ Q3_K = 11,
+ Q4_K = 12,
+ Q5_K = 13,
+ Q6_K = 14,
+ Q8_K = 15,
+ IQ2_XXS = 16,
+ IQ2_XS = 17,
+ IQ3_XXS = 18,
+ IQ1_S = 19,
+ IQ4_NL = 20,
+ IQ3_S = 21,
+ IQ2_S = 22,
+ IQ4_XS = 23,
+}


enums are not strictly speaking types as they expose objects in the runtime.

End of not super useful pedantic note haha. cc @coyotte508

julien-c · 2024-03-20T12:14:48Z

packages/gguf/src/types.ts

+export type RWKV = ModelBase<"rwkv"> & { "rwkv.architecture_version": number };
+export type LLM = TransformerLLM | RWKV;
+export type Whisper = ModelBase<"encoder.whisper"> & ModelBase<"decoder.whisper">;
+export type Model = (LLM | Whisper) & Partial<Tokenizer>;


very neat types (though they make my head hurt a bit, lol)

julien-c · 2024-03-20T12:16:25Z

packages/gguf/src/types.ts

+ "llama",
+ "mpt",
+ "gptneox",
+ "gptj",
+ "gpt2",
+ "bloom",
+ "falcon",
+ "gemma",
+ "rwkv",
+ "whisper",


Suggested change

"llama",

"mpt",

"gptneox",

"gptj",

"gpt2",

"bloom",

"falcon",

"gemma",

"rwkv",

"whisper",

"llama",

"falcon",

"baichuan",

"gpt2",

"gptj",

"gptneox",

"mpt",

"starcoder",

"persimmon",

"refact",

"bert",

"nomic-bert",

"bloom",

"stablelm",

"qwen",

"qwen2",

"phi2",

"plamo",

"codeshell",

"orion",

"internlm2",

"minicpm",

"gemma",

"starcoder2",

"mamba",

(optional, but it's the current list from the llama.cpp source of truth IIUC)

julien-c

i would merge as is and iterate later

Follow up to #562

[gguf] Add types

b244568

mishig25 requested review from julien-c and coyotte508 March 19, 2024 17:00

mishig25 marked this pull request as ready for review March 19, 2024 17:01

mishig25 commented Mar 19, 2024

View reviewed changes

mishig25 added 2 commits March 20, 2024 11:51

stronger typing for ModelBase

a948e16

format

81e6ce1

julien-c reviewed Mar 20, 2024

View reviewed changes

julien-c approved these changes Mar 20, 2024

View reviewed changes

julien-c reviewed Mar 20, 2024

View reviewed changes

julien-c approved these changes Mar 20, 2024

View reviewed changes

mishig25 merged commit e745ba5 into main Mar 20, 2024
2 checks passed

mishig25 deleted the gguf_types branch March 20, 2024 12:32

This was referenced Mar 20, 2024

[gguf types] Add missing types & make existing types stronger #566

Closed

[gguf] Export MetadataBaseValue #571

Merged

mishig25 added a commit that referenced this pull request Mar 21, 2024

[gguf] Export MetadataBaseValue (#571)

32d403e

Follow up to #562

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[gguf] Add types #562

[gguf] Add types #562

mishig25 commented Mar 19, 2024 •

edited

mishig25 Mar 19, 2024

julien-c Mar 20, 2024

julien-c Mar 20, 2024

julien-c Mar 20, 2024

julien-c Mar 20, 2024

julien-c left a comment

		export type { MetadataValue, Version, GGUFMetadata, GGUFTensorInfo, GGUFParseOutput } from "./types";
		export { GGUFValueType, GGMLQuantizationType } from "./types";

- "llama",
- "mpt",
- "gptneox",
- "gptj",
- "gpt2",
- "bloom",
- "falcon",
- "gemma",
- "rwkv",
- "whisper",
+ "llama",
+ "falcon",
+ "baichuan",
+ "gpt2",
+ "gptj",
+ "gptneox",
+ "mpt",
+ "starcoder",
+ "persimmon",
+ "refact",
+ "bert",
+ "nomic-bert",
+ "bloom",
+ "stablelm",
+ "qwen",
+ "qwen2",
+ "phi2",
+ "plamo",
+ "codeshell",
+ "orion",
+ "internlm2",
+ "minicpm",
+ "gemma",
+ "starcoder2",
+ "mamba",

[gguf] Add types #562

[gguf] Add types #562

Conversation

mishig25 commented Mar 19, 2024 • edited

mishig25 Mar 19, 2024

Choose a reason for hiding this comment

julien-c Mar 20, 2024

Choose a reason for hiding this comment

julien-c Mar 20, 2024

Choose a reason for hiding this comment

julien-c Mar 20, 2024

Choose a reason for hiding this comment

julien-c Mar 20, 2024

Choose a reason for hiding this comment

julien-c left a comment

Choose a reason for hiding this comment

mishig25 commented Mar 19, 2024 •

edited