null | Eddie’s Blog

Date

Dec 2, 2023 10:00

Type

only Students

Contributors

Developing a Multi-Modal True Color Detection AI Model Tailored for Diverse Industries, with a Emphasis on Fashion Industry

Crafting an Exquisite Multi-Modal True Color Detection AI Model Tailored for Diverse Industries, with a Distinct Emphasis on the Opulent World of Fashion

Developing a Multi-Modal True Color Detection AI Model Tailored for Diverse Industries, with a Special Emphasis on the Fashion Sector

제안 배경

현대 사회에서 개인의 의모와 스타일은 중요한 관심사 중 하나로 여겨지고 있다. Personal color는 이러한 맥락에서 개인의 특성을 고려하여 어울리는 색상을 찾아낼 수 있다는 점에서 중요한 개념이다. 그러나 현재의 personal color market은 여성의 cosmetic market에만 집중되고 있기 때문에 clothing market에서는 비전문가인 개인이 personal color 개념을 적용하기가 어려운 상황이다. 이러한 문제를 해결하기 위한 하나의 방법으로, on/offline에서 사용자의 personal color data를 입력하면 그에 맞는 의류를 추천해주거나, 사용자의 구매 희망 의류에 대해 personal color 적합도를 평가하는 AI model을 제시하고자 한다. 이러한 model의 이름을 Personalized Color Matching Apparel AI Model이라 지칭하겠다. 본 model을 build하기 위해서는 먼저 해결되어야 하는 7개의 sub-problems가 있다. 그중 다른 산업에서도 적용이 가능하며, 최종 AI Model에 가장 critical한 영향을 미치는 sub-problem을 본 연구의 목표로 설정하였다.

과제 목표

텍스트와 이미지를 기반으로 옷의 원본 색을 추출하는 Multi Modal True Color Detection AI Model을 build함으로써, 기존 의류 산업의 한계점을 극복하고, 나아가 True Color가 중요한 기타 산업에도 새로운 approach를 제공한다.

과제 내용

NLP : 자연어 처리를 통한 피드백 제공

True Color Detection : 다른 조건의 이미지에서 동일한 색상의 정확한 RGB 값 추출

Multi Modal Ai - 의류 사용자 리뷰 (텍스트) 기반 데이터 + 의류 사진 데이터 기반 모델 학습

기대 효과

고객 입장에서의 비용 절감

무신사와 같은 온라인 패션몰이 dominant한 영향을 끼치고 있는 상황에서, 온라인에서 옷을 구매하는 소비자들이 많아지고 있다. 하지만 여전히 실제로 옷을 받아보기 전까지는 정확한 color를 알아낼 수 없기에, 교환이나 반품이 자주 일어나고 있다. 정확한 color information을 제공할 수 있다면 이러한 문제를 해결할 수 있다.

브랜드 입장에서의 비용 절감

원본 색상이 잘 반영된 제품 사진을 얻기 위해선 다양한 각도에서의 사진 촬영, 조명 조절. 편집 소프트웨어 사용 등 많은 어려움이 있다. 해당 model을 사용함으로서 이러한 준비 과정을 줄여 비용을 절감할 수 있다.

산업군 확장성

True Color Detection AI Model이 적절한 수준으로 개발이 된다면, 의류 산업 뿐만 아니라 농업, 임업 등 다양한 산업군의 문제 상황에도 솔루션으로 적용 가능할 것으로 기대된다.

참고자료

[1] Srinivasan, K., Raman, K., Chen, J., Bendersky, M., & Najork, M. (2021, July). Wit: Wikipedia-based image text dataset for multimodal multilingual machine learning. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 2443-2449). https://dl.acm.org/doi/pdf/10.1145/3404835.3463257

[2] Zhang, Y., Jiang, H., Miura, Y., Manning, C. D., & Langlotz, C. P. (2022, December). Contrastive learning of medical visual representations from paired images and text. In Machine Learning for Healthcare Conference (pp. 2-25). PMLR. https://proceedings.mlr.press/v182/zhang22a/zhang22a.pdf

[3] Singh, A., & Tucker, C. S. (2017). A machine learning approach to product review disambiguation based on function, form and behavior classification. Decision Support Systems, 97, 81-91. https://www.sciencedirect.com/science/article/pii/S0167923617300477?casa_token=wlEq6_tjkRYAAAAA:vc2BSsu2HUj3TfiZAkQAoPm9BjFIilN--DBZ6JlYtSG44sWnl2uTYKDR3-UGOIcjG18Pg4tIi0rl

[4] Li, M. (2021). Research on extraction of useful tourism online reviews based on multimodal feature fusion. Transactions on Asian and Low-Resource Language Information Processing, 20(5), 1-16. https://dl.acm.org/doi/abs/10.1145/3453694?casa_token=hHpgtZwYPrwAAAAA:FGk-6MpKqgBV0yJX3x2X1H4DZfICOtS1sJTdll3wwd7kWYhgvcCoghVSFw2iNMIrS-fx2jBpa9aQDOs