See More
Popular Forum

MBA (4887) B.Tech (1769) Engineering (1486) Class 12 (1030) Study Abroad (1004) Computer Science and Engineering (988) Business Management Studies (865) BBA (846) Diploma (746) CAT (651) B.Com (648) B.Sc (643) JEE Mains (618) Mechanical Engineering (574) Exam (525) India (462) Career (452) All Time Q&A (439) Mass Communication (427) BCA (417) Science (384) Computers & IT (Non-Engg) (383) Medicine & Health Sciences (381) Hotel Management (373) Civil Engineering (353) MCA (349) Tuteehub Top Questions (348) Distance (340) Colleges in India (334)
See More
( 7 months ago )

What are best practices for collaborative feature engineering?

General Tech Learning Aids/Tools
Max. 2000 characters

Sarah Jones


( 7 months ago )

I work in a large company on several data science projects. For each of the projects me and my colleagues construct features that have some predictive value for the specific target in that project.

Some project are similar in that they a predict something for the same kind of entity, for example customers or goods.

It would make sense to me to share features between projects that are about the same entity. Or at the least, make it easy to reuse features from another project. For example, in some project someone could construct the feature "customer since" which would indicate the number of years someone is a customer. In some other project some constructed a feature "estimated age" that is the outcome of some machine learning pipeline. In a third project I might want to use both features.

What are best practices in sharing these features? Should I share code or materialized outcomes? Are there packages that aid this process? How does your company solve this?

what's your interest