Hostname: page-component-745bb68f8f-hvd4g Total loading time: 0 Render date: 2025-01-27T10:06:29.080Z Has data issue: false hasContentIssue false

DATA-PUSHED PROJECTS: THE ROLE OF ANOMALIES TO BUILD DESIGN PROCESSES FOR SUBSEQUENT EXPLORATION

Published online by Cambridge University Press:  19 June 2023

Antoine Bordas*
Affiliation:
Mines Paris, PSL University, Centre for management science (CGS), i3 UMR CNRS, 75006 Paris, France
Pascal Le Masson
Affiliation:
Mines Paris, PSL University, Centre for management science (CGS), i3 UMR CNRS, 75006 Paris, France
Benoit Weil
Affiliation:
Mines Paris, PSL University, Centre for management science (CGS), i3 UMR CNRS, 75006 Paris, France
*
Bordas, Antoine, Mines Paris, France, antoine.bordas@minesparis.psl.eu

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the ‘Save PDF’ action button.

Data-pushed projects are common in companies and consist in the design of a model in order to deliver a desirable output. The design of data science models appears at the intersection of optimisation and creativity logic, with in both cases the presence of anomalies to a various extent but no clear design process.

This paper therefore proposes to study the possible design processes in data-pushed projects, highlighting distinct knowledge exploration logics and the role of anomalies in each. This research introduces a theoretical framework to study data-pushed projects and is based on design theory. Three case studies complete this theoretical work to examine each of the processes and test our hypothesis.

As a result, this paper derives three design processes adapted to data-pushed projects and put forward for each of them: 1) the various knowledge leveraged and generated and 2) the specific role of anomalies.

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - ND
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives licence (http://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is unaltered and is properly cited. The written permission of Cambridge University Press must be obtained for commercial re-use or in order to create a derivative work.
Copyright
The Author(s), 2023. Published by Cambridge University Press

References

Adadi, A., Berrada, M., 2018. Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI). IEEE Access 6, 5213852160. https://doi.org/10.1109/ACCESS.2018.2870052CrossRefGoogle Scholar
Alsabti, K., Ranka, S., Singh, V., 1997. An efficient k-means clustering algorithm.Google Scholar
Barbier, R., Le Masson, P., Weil, B., 2021. Transforming data into added-value information: the design of scientific measurement models through the lens of design theory. Proc. Des. Soc. 1, 32393248. https://doi.org/10.1017/pds.2021.585CrossRefGoogle Scholar
Barnett, V., Lewis, T., 1984. Outliers in statistical data. Wiley Series in Probability and Mathematical Statistics. Applied Probability and Statistics.Google Scholar
Bloor, D., 1978. Polyhedra and the Abominations of Leviticus. The British Journal for the History of Science 11, 245272. https://doi.org/10.1017/S000708740004379XCrossRefGoogle Scholar
Braha, D., Reich, Y., 2003. Topological structures for modeling engineering design processes. Res Eng Design 14, 185199. https://doi.org/10.1007/s00163-003-0035-3CrossRefGoogle Scholar
Cao, L., 2018. Data Science: A Comprehensive Overview. ACM Comput. Surv. 50, 142.CrossRefGoogle Scholar
Cascini, G., Nagai, Y., Georgiev, G.V., Zelaya, J., Becattini, N., Boujut, J.F., Casakin, H., Crilly, N., Dekoninck, E., Gero, J., Goel, A., Goldschmidt, G., Gonçalves, M., Grace, K., Hay, L., Le Masson, P., Maher, M.L., Marjanović, D., Motte, D., Papalambros, P., Sosa, R., S, V., Štorga, M., Tversky, B., Yannou, B., Wodehouse, A., 2022. Perspectives on design creativity and innovation research: 10 years later. International Journal of Design Creativity and Innovation 10, 130. https://doi.org/10.1080/21650349.2022.2021480CrossRefGoogle Scholar
Corral, K., Schuff, D., Schymik, G., Louis, R.S., 2015. Enabling Self-Service BI Through a Dimensional Model Management Warehouse. 2015 Americas Conference on Information Systems, AMCIS 2015.Google Scholar
Davis, N.M., 2013. Human-Computer Co-Creativity: Blending Human and Computational Creativity, in: Ninth Artificial Intelligence and Interactive Digital Entertainment Conference. Presented at the Ninth Artificial Intelligence and Interactive Digital Entertainment Conference.Google Scholar
Daw, A., Karpatne, A., Watkins, W., Read, J., Kumar, V., 2021. Physics-guided Neural Networks (PGNN): An Application in Lake Temperature Modeling. https://doi.org/10.48550/arXiv.1710.11431CrossRefGoogle Scholar
Einstein, A., 1987. Letters to Solovine. Philosophical Library.Google Scholar
European Parliament, 2016. Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016 on the protection of natural persons with regard to the processing of personal data and on the free movement of such data, and repealing Directive 95/46/EC (General Data Protection Regulation), 2016.Google Scholar
Fahrmeir, L., Kneib, T., Lang, S., Marx, B.D., 2021. Regression Models, in: Fahrmeir, L., Kneib, T., Lang, S., Marx, B.D. (Eds.), Regression: Models, Methods and Applications. Springer, Berlin, Heidelberg, pp. 2384. https://doi.org/10.1007/978-3-662-63882-8_2CrossRefGoogle Scholar
Friedman, J., Hastie, T., Tibshirani, R., 2001. The elements of statistical learning. Springer series in statistics New York.Google Scholar
Hatchuel, A., Reich, Y., Le Masson, P., Weil, B., Kazakçi, A., 2013. Beyond Models and Decisions: Situating Design through generative functions, in: International Conference on Engineering Design. Séoul, South Korea.Google Scholar
Hatchuel, A., Weil, B., 2008. C-K design theory: an advanced formulation. Res Eng Design 19, 181.CrossRefGoogle Scholar
Hatchuel, A., Weil, B., 2003. A new approach of innovative design: an introduction to C-K theory. DS 31: Proceedings of ICED 03, the 14th International Conference on Engineering Design, Stockholm 109-110 (exec.summ.), full paper no. DS31_1794FPC.Google Scholar
Hodge, V., Austin, J., 2004. A Survey of Outlier Detection Methodologies. Artificial Intelligence Review 22, 85126. https://doi.org/10.1023/B:AIRE.0000045502.10941.a9CrossRefGoogle Scholar
Holton, 1981. L'imagination scientifique. Gallimard.Google Scholar
Howard, T., Culley, S.J., Dekoninck, E., 2007. Creativity in the Engineering Design Process. DS 42: Proceedings of ICED 2007, the 16th International Conference on Engineering Design, Paris, France, 28.-31.07.2007 329-330 (exec. Summ.), full paper no. DS42_P_493.Google Scholar
Howard, T.J., Dekoninck, E.A., Culley, S.J., 2010. The use of creative stimuli at early stages of industrial product innovation. Res Eng Design 21. https://doi.org/10.1007/s00163-010-0091-4CrossRefGoogle Scholar
James, G., Witten, D., Hastie, T., Tibshinari, R, 2013. An Introduction to Statistical Learning, New York: Springer. ed.CrossRefGoogle Scholar
Karpatne, A., Atluri, G., Faghmous, J.H., Steinbach, M., Banerjee, A., Ganguly, A., Shekhar, S., Samatova, N., Kumar, V., 2017. Theory-Guided Data Science: A New Paradigm for Scientific Discovery from Data. IEEE Transactions on Knowledge and Data Engineering 29, 23182331. https://doi.org/10.1109/TKDE.2017.2720168CrossRefGoogle Scholar
Kazakçı, A.O., 2015. Data science as a new frontier for design. Presented at the International Conference on Engineering Design.Google Scholar
Kuhn, T., 2021. The Structure of Scientific Revolutions, in: Philosophy after Darwin: Classic and Contemporary Readings. Princeton University Press, pp. 176177. https://doi.org/10.1515/9781400831296-024CrossRefGoogle Scholar
Lu, S., Li, Z., Qin, Z., Yang, X., Goh, R.S.M., 2017. A hybrid regression technique for house prices prediction, in: 2017 IEEE International Conference on Industrial Engineering and Engineering Management (IEEM). Presented at the 2017 IEEE International Conference on Industrial Engineering and Engineering Management (IEEM), pp. 319323. https://doi.org/10.1109/IEEM.2017.8289904CrossRefGoogle Scholar
Oakland, J., 2007. Statistical Process Control, 6th ed. Routledge, London. https://doi.org/10.4324/9780080551739CrossRefGoogle Scholar
Pahl, G., Beitz, W., Feldhusen, J., Grote, K.-H., 2007. Engineering Design. Springer, London. https://doi.org/10.1007/978-1-84628-319-2CrossRefGoogle Scholar
Redelinghuys, C., Bahill, A.T., 2006. A framework for the assessment of the creativity of product design teams. Journal of Engineering Design 17, 121141. https://doi.org/10.1080/09544820500273136CrossRefGoogle Scholar
Ruder, S., 2017. An overview of gradient descent optimization algorithms. https://doi.org/10.48550/arXiv.1609.04747CrossRefGoogle Scholar
Runco, M.A., 1994. Problem Finding, Problem Solving, and Creativity. Greenwood Publishing Group.Google Scholar
Skiena, S.S., 2017. The Data Science Design Manual, Texts in Computer Science. Springer International Publishing, Cham. https://doi.org/10.1007/978-3-319-55444-0CrossRefGoogle Scholar
Trabucchi, D., Buganza, T., 2018. Data-driven innovation: switching the perspective on Big Data. European Journal of Innovation Management 22, 2340. https://doi.org/10.1108/EJIM-01-2018-0017CrossRefGoogle Scholar
von Rueden, L., Mayer, S., Beckh, K., Georgiev, B., Giesselbach, S., Heese, R., Kirsch, B., Pfrommer, J., Pick, A., Ramamurthy, R., Walczak, M., Garcke, J., Bauckhage, C., Schuecker, J., 2021. Informed Machine Learning -- A Taxonomy and Survey of Integrating Knowledge into Learning Systems. IEEE Trans. Knowl. Data Eng. 11. https://doi.org/10.1109/TKDE.2021.3079836CrossRefGoogle Scholar