KeruruDiary

2026/05/03(日)2026-05-03

foundryでLocalLLMする

Snapdragonとか言う変態的なCPUを積んだNotePCをメインPCにしてしまったのでLocalLLMを愉しむのも一苦労したのでみんなちゃんとMacbook買おうね。さてSnapdragon搭載のWindowsでは現在一番まともに使えそうなのはfoundryだ。

PS C:\windows\system32> foundry model list
Alias                          Device     Task           File Size    License      Model ID
-----------------------------------------------------------------------------------------------
qwen2.5-coder-0.5b             CPU        chat, tools    0.80 GB      apache-2.0   qwen2.5-coder-0.5b-instruct-generic-cpu:4
--------------------------------------------------------------------------------------------------------------------------------
phi-4-mini-reasoning           CPU        chat           4.52 GB      MIT          Phi-4-mini-reasoning-generic-cpu:3
-------------------------------------------------------------------------------------------------------------------------
qwen2.5-0.5b                   CPU        chat, tools    0.80 GB      apache-2.0   qwen2.5-0.5b-instruct-generic-cpu:4
--------------------------------------------------------------------------------------------------------------------------
qwen2.5-1.5b                   NPU        chat, tools    2.78 GB      MIT          qwen2.5-1.5b-instruct-qnn-npu:2
                               CPU        chat, tools    1.78 GB      apache-2.0   qwen2.5-1.5b-instruct-generic-cpu:4
--------------------------------------------------------------------------------------------------------------------------
qwen2.5-coder-1.5b             CPU        chat, tools    1.78 GB      apache-2.0   qwen2.5-coder-1.5b-instruct-generic-cpu:4
--------------------------------------------------------------------------------------------------------------------------------
phi-4-mini                     CPU        chat, tools    4.80 GB      MIT          Phi-4-mini-instruct-generic-cpu:5
------------------------------------------------------------------------------------------------------------------------
qwen2.5-14b                    CPU        chat, tools    11.06 GB     apache-2.0   qwen2.5-14b-instruct-generic-cpu:4
-------------------------------------------------------------------------------------------------------------------------
qwen2.5-coder-14b              CPU        chat, tools    11.06 GB     apache-2.0   qwen2.5-coder-14b-instruct-generic-cpu:4
-------------------------------------------------------------------------------------------------------------------------------
qwen2.5-coder-7b               CPU        chat, tools    6.16 GB      apache-2.0   qwen2.5-coder-7b-instruct-generic-cpu:4
------------------------------------------------------------------------------------------------------------------------------
qwen2.5-7b                     NPU        chat, tools    2.78 GB      MIT          qwen2.5-7b-instruct-qnn-npu:2
                               CPU        chat, tools    6.16 GB      apache-2.0   qwen2.5-7b-instruct-generic-cpu:4
------------------------------------------------------------------------------------------------------------------------
gpt-oss-20b                    CPU        chat           12.26 GB     MIT          gpt-oss-20b-generic-cpu:1
----------------------------------------------------------------------------------------------------------------
phi-3-mini-128k                NPU        chat           2.78 GB      MIT          phi-3-mini-128k-instruct-qnn-npu:3
                               CPU        chat           2.54 GB      MIT          Phi-3-mini-128k-instruct-generic-cpu:3
-----------------------------------------------------------------------------------------------------------------------------
phi-3.5-mini                   NPU        chat           2.78 GB      MIT          phi-3.5-mini-instruct-qnn-npu:2
                               CPU        chat           2.53 GB      MIT          Phi-3.5-mini-instruct-generic-cpu:2
--------------------------------------------------------------------------------------------------------------------------
phi-4                          CPU        chat           10.16 GB     MIT          Phi-4-generic-cpu:2
----------------------------------------------------------------------------------------------------------
deepseek-r1-7b                 NPU        chat           3.71 GB      MIT          deepseek-r1-distill-qwen-7b-qnn-npu:2
                               CPU        chat           6.43 GB      MIT          deepseek-r1-distill-qwen-7b-generic-cpu:4
--------------------------------------------------------------------------------------------------------------------------------
phi-3-mini-4k                  NPU        chat           2.78 GB      MIT          phi-3-mini-4k-instruct-qnn-npu:3
                               CPU        chat           2.53 GB      MIT          Phi-3-mini-4k-instruct-generic-cpu:3
---------------------------------------------------------------------------------------------------------------------------
mistral-7b-v0.2                CPU        chat           4.07 GB      apache-2.0   mistralai-Mistral-7B-Instruct-v0-2-generic-cpu:3
---------------------------------------------------------------------------------------------------------------------------------------
deepseek-r1-14b                NPU        chat           7.12 GB      MIT          deepseek-r1-distill-qwen-14b-qnn-npu:2
                               CPU        chat           11.51 GB     MIT          deepseek-r1-distill-qwen-14b-generic-cpu:4
---------------------------------------------------------------------------------------------------------------------------------
qwen3-0.6b                     CPU        chat, tools    0.58 GB      apache-2.0   qwen3-0.6b-generic-cpu:4
PS C:\windows\system32>

現状これだけのモデルが使えてDeviceがNPUのものであればNPUを活用してLocalLLMが使えるようになる。 45TOPS程度なのでGPUゴリゴリのPCには当然負けるし、NPUが使えるモデルがまだまだ少ない現実なのでどこまで活用できる？と言われると微妙だなぁと言わざる得ない。

		2026/05
日	月	火	水	木	金	土
					1	2
3	4	5	6	7	8	9
10	11	12	13	14	15	16
17	18	19	20	21	22	23
24	25	26	27	28	29	30
31

KeruruDiary

メッセージ

2026/05/03(日)2026-05-03

foundryでLocalLLMする

素直にCopilot+PCとして使う？

MTerm

おめざめ

散髪