Re: AI/ML Model and Pre-Trained Weight Packaging in Fedora

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 04/03/2024 18.44, Ben Beasley wrote:
> If you want to apply additional review to neural net coefficients, I
> suppose you might as well start with those already packaged in
> stockfish[1]. (I CC’d the stockfish-maintainers email alias to loop in
> the primary maintainer. I am a co-maintainer, and I did the original
> package review.)
> 
> Stockfish is a state-of-the-art chess engine. The code is licensed
> GPL-3.0-or-later, but it requires two pre-trained neural network
> coefficient files to function. These coefficient files are selected from
> those at [2], all licensed CC0-1.0, and they are compiled into the
> binaries rather than shipped as separate files. This is quite consistent
> with treating them as content; there is a long history of including
> content – like graphics, audio, or text files – as data in compiled
> executables.
> 
> The Fedora package always uses the “default” coefficient sets for a
> particular release of Stockfish, as defined in [3].
> 
> – Ben Beasley (FAS: music)
> 
> [1] https://src.fedoraproject.org/rpms/stockfish
> 
> [2] https://tests.stockfishchess.org/nns
> 
> [3]
> https://github.com/official-stockfish/Stockfish/blob/e67cc979fd2c0e66dfc2b2f2daa0117458cfc462/src/evaluate.h#L42-L43
> 
> On 3/1/24 5:19 PM, Richard Fontana wrote:
>> Following Tim's explanations of various things, here are revised
>> answers to the questions:
>>
>> On Mon, Feb 26, 2024 at 6:32 PM Tim Flink<tflink@xxxxxxxxxxxxxxxxx> 
>> wrote:
>>
>>> Questions
>>> =========
>>>
>>> 1. Are pre-trained weights considered to be normal non-code
>>> content/data or do they require special handling?
>> For Fedora license classification purposes, they should be considered
>> "content". However, I think for any specific pre-trained weights that
>> will actually be included in Fedora packages, for some initial period
>> I'd like to do some further review (as noted upthread, because this is
>> an important policy area and we don't have a lot of prior experience
>> in it). I don't really care how that's done, that could be through
>> this list or a Bugzilla or whatever.
>>
>> We'll add "pre-trained weights" to the list of examples of what
>> "content" is in the Fedora legal docs.
>>
>>> 2. If an upstream offers pre-trained weights and indicates that those
>>> weights are available under a license which is acceptable for
>>> non-code content in Fedora, can those pre-trained weights be included
>>> in Fedora packages?
>> Yes subject to my answer to 1.
For data driven models such as pre-trained weights, some knowledge about
the data used for the training is required.  It would be good to have
some consideration of this, for example is a neural network code
generator which uses GPL code as training data also under GPL?
>>
>>> 3. Extending question 2, is it considered sufficient for an upstream
>>> to have a license on pre-trained weights or would a packager/reviewer
>>> need to verify that the data used to train those weights is acceptable?
>> A packager/reviewer should not need to do that verification, which
>> seems highly impractical (which is a point I think you may have
>> previously made). However, that could be an aspect of the "initial
>> legal review" I'm suggesting we may want to have for such cases.
>>
>>> 4. Is it acceptable to package code which downloads pre-trained
>>> weights from a non-Fedora source upon first use post-installation by
>>> a user if that model and its associated weights are
>>>      a. For a specific model?
>>>      b. For a user-defined model which may or may not exist at the
>>> time of packaging?
>> Given your explanations of these cases, I think this is pretty
>> straightforward.
>> 4a: Yes
>> 4b: Yes
>>
>> These answers only go to matters of Fedora legal/licensing policy. If
>> there are technical issues raised by these questions (for example, if
>> there ought to be some standards around packaging of upstream
>> pre-trained weights) I can't give guidance or informed opinions on
>> that beyond my initial suggestion to raise this topic with FESCo which
>> seems to have been unsuccessful.
>>
>> Richard
>> -- 
--
_______________________________________________
legal mailing list -- legal@xxxxxxxxxxxxxxxxxxxxxxx
To unsubscribe send an email to legal-leave@xxxxxxxxxxxxxxxxxxxxxxx
Fedora Code of Conduct: https://docs.fedoraproject.org/en-US/project/code-of-conduct/
List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines
List Archives: https://lists.fedoraproject.org/archives/list/legal@xxxxxxxxxxxxxxxxxxxxxxx
Do not reply to spam, report it: https://pagure.io/fedora-infrastructure/new_issue




[Index of Archives]     [Fedora Users]     [Fedora Desktop]     [Fedora SELinux]     [Big List of Linux Books]     [Yosemite News]     [Gnome Users]     [KDE Users]

  Powered by Linux