Hey, I'm a student developer who's interested in getting into open source development.I came across Ludwig while browsing for projects and found it interesting and would like to contribute to it in some way. I do have two years of industry experience as a software developer but I'm new to open source development. Are there any beginner-friendly features or modules that I can work on ? I'd prefer to start on some easy tasks/bug-fixes that would roughly take about a couple of days or a week to complete, before moving on to bigger features. Thanks a lot for your help in advance!
@desikan95 thank you really much for your interest and your willingness to get involved!
I believe the best way to get started without the need to get all the nitty gritty details of Ludwig internals is helping out with tests. It's a great way to gradually get acquainted with and discose complexity gradually.
At the same time, there are also some tasks and features that are not tests that we would be happy to receive help with, I suggest you check the todo column of the Ludwig project here: https://github.com/uber/ludwig/projects/1
If something pops that's particularly interesting to you we could go that route instead.
Let me know what you think and we can get you invovled :)
@w4nderlust I'm happy to help out! Thank you for your quick response. I went through some of the issues and I thought I could work on supporting more data sources for Ludwig (https://github.com/uber/ludwig/issues/847).
I'm happy to help write tests for any existing features as well, if you feel that will help me get acquainted with the project quicker. Pls feel free to assign either of these tasks to me.
@desikan95 thank you really much for your availability! Really appreciated.
Regarding data soruces, in the recent v0.3 version of Ludwig we added many new supported formats already, and are still working on adding even more and improve the preprocessing capabilities to make everything data related in Lduwig more efficient and scalable. In this regard, we have a PR we actively working on for v0.4: https://github.com/uber/ludwig/pull/970 . I wouldn't suggest you to get started helping with that though, because the PR is already in a pretty advanced state, and impacts quaite a lot of the Ludwig codebase.
What i would suggest is rather: once that PR is landed, you could help improving it further by adding more data sources support (like hooking to SQL sources would be great).
In the mean time though, while we complete that PR, if you want to already get started, working on some tests seems ideal to me and will be a good way of approaching Ludwig :)
If you send me your email privately i can invite you to the community chat, where we have a channel specifically devoted to testing.