{"id":4645,"date":"2018-10-10T11:40:29","date_gmt":"2018-10-10T09:40:29","guid":{"rendered":"https:\/\/www.kde.cs.uni-kassel.de\/?page_id=4645"},"modified":"2023-11-27T13:27:30","modified_gmt":"2023-11-27T12:27:30","slug":"stubbemann","status":"publish","type":"page","link":"https:\/\/www.kde.cs.uni-kassel.de\/en\/stubbemann","title":{"rendered":"Maximilian Stubbemann"},"content":{"rendered":"<div id=\"trailimageid\"><img decoding=\"async\" id=\"ttimg\" src=\"https:\/\/www.kde.cs.uni-kassel.de\/wp-content\/plugins\/bibsonomy-csl\/img\/loading.gif\"><\/div> <div class=\"wp-block-image\">\n<figure class=\"alignright size-large is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"474\" height=\"608\" src=\"https:\/\/www.kde.cs.uni-kassel.de\/wp-content\/uploads\/2020\/03\/maxst.jpeg\" alt=\"\" class=\"wp-image-6987\" style=\"width:224px;height:288px\" srcset=\"https:\/\/www.kde.cs.uni-kassel.de\/wp-content\/uploads\/2020\/03\/maxst.jpeg 474w, https:\/\/www.kde.cs.uni-kassel.de\/wp-content\/uploads\/2020\/03\/maxst-234x300.jpeg 234w\" sizes=\"auto, (max-width: 474px) 100vw, 474px\" \/><\/figure>\n<\/div>\n\n\n<p>I have left the Knowledge and Data Engineering Group on 31.10.2023. I am now working at the Information Systems and Machine Learning Lab of the University of Hildesheim. My new homepage is available <a href=\"https:\/\/www.ismll.uni-hildesheim.de\/personen\/stubbemann_en.html\">here<\/a>.<br>Email: <a href=\"mailto:stubbemann@cs.uni-kassel.de\"><u><span style=\"color: #0066cc;\">stubbemann@cs.uni-kassel.de<\/span><\/u><\/a><\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><\/h2>\n\n\n\n<h2 class=\"wp-block-heading\">Publications<\/h2>\n\n\n\n<p><ul class=\"bibsonomycsl_publications\"><li class=\"bibsonomycsl_pubitem\"><div class=\"bibsonomycsl_entry\"><div class=\"csl-bib-body\">\n  <div class=\"csl-entry\"><div class=\"csl-left-margin\"><span style=\"display: none;\">1.<\/span><\/div><div class=\"csl-right-inline\"><span class=\"csl-author\">Stubbemann, M., Hanika, T., Schneider, F.M.: <\/span><span class=\"csl-title\"><span class=\"csl-title\">Intrinsic Dimension for Large-Scale Geometric Learning<\/span>.<\/span> Transactions on Machine Learning Research. (2023).<\/div><\/div>\n<\/div><span class=\"bibsonomycsl_url\"><a href=\"https:\/\/openreview.net\/forum?id=85BfDdYMBY\" target=\"_blank\">URL<\/a><\/span><span class=\"bibsonomycsl_export bibsonomycsl_bibtex\"><a rel=\"bib-292ea436563609314995751c9edb3c43\" href=\"#\">BibTeX<\/a><\/span><span class=\"bibsonomycsl_export bibsonomycsl_endnote\"><a rel=\"end-292ea436563609314995751c9edb3c43\" href=\"#\">EndNote<\/a><\/span><div style=\"clear: left\"> <\/div><div class=\"bibsonomycsl_collapse bibsonomycsl_pub_bibtex\" style=\"display:none;\" id=\"bib-292ea436563609314995751c9edb3c43\"><p>@article{stubbemann2022intrinsic,<br\/>  author = {Stubbemann, Maximilian and Hanika, Tom and Schneider, Friedrich Martin},<br\/>  journal = {Transactions on Machine Learning Research},<br\/>  keywords = {itegpub},<br\/>  title = {Intrinsic Dimension for Large-Scale Geometric Learning},<br\/>  year = 2023<br\/>}<br\/><\/p><\/div><div class=\"bibsonomycsl_collapse bibsonomycsl_pub_endnote\" style=\"display:none;\" id=\"end-292ea436563609314995751c9edb3c43\"><p>%0 Journal Article<br\/>%1 stubbemann2022intrinsic<br\/>%A Stubbemann, Maximilian<br\/>%A Hanika, Tom<br\/>%A Schneider, Friedrich Martin<br\/>%D 2023<br\/>%J Transactions on Machine Learning Research<br\/>%T Intrinsic Dimension for Large-Scale Geometric Learning<br\/>%U https:\/\/openreview.net\/forum?id=85BfDdYMBY<br\/><\/p><\/div><\/div><\/li><li class=\"bibsonomycsl_pubitem\"><div class=\"bibsonomycsl_entry\"><div class=\"csl-bib-body\">\n  <div class=\"csl-entry\"><div class=\"csl-left-margin\"><span style=\"display: none;\">1.<\/span><\/div><div class=\"csl-right-inline\"><span class=\"csl-author\">Stubbemann, M., Stumme, G.: <\/span><span class=\"csl-title\"><span class=\"csl-title\">The Mont Blanc of Twitter: Identifying Hierarchies of Outstanding Peaks in Social Networks<\/span>.<\/span> In: European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML-PKDD 2023. pp. 177\u2013192. Springer (2023). https:\/\/doi.org\/10.1007\/978-3-031-43418-1\\_11.<\/div><\/div>\n<\/div><span class=\"bibsonomycsl_url\"><a href=\"https:\/\/doi.org\/10.1007\/978-3-031-43418-1\\_11\" target=\"_blank\">URL<\/a><\/span><span class=\"bibsonomycsl_export bibsonomycsl_bibtex\"><a rel=\"bib-004218ef1c7594c76cc7b433e950bef0\" href=\"#\">BibTeX<\/a><\/span><span class=\"bibsonomycsl_export bibsonomycsl_endnote\"><a rel=\"end-004218ef1c7594c76cc7b433e950bef0\" href=\"#\">EndNote<\/a><\/span><div style=\"clear: left\"> <\/div><div class=\"bibsonomycsl_collapse bibsonomycsl_pub_bibtex\" style=\"display:none;\" id=\"bib-004218ef1c7594c76cc7b433e950bef0\"><p>@inproceedings{DBLP:conf\/pkdd\/StubbemannS23,<br\/>  author = {Stubbemann, Maximilian and Stumme, Gerd},<br\/>  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML-PKDD 2023},<br\/>  keywords = {itegpub},<br\/>  pages = {177--192},<br\/>  publisher = {Springer},<br\/>  series = {Lecture Notes in Computer Science},<br\/>  title = {The Mont Blanc of Twitter: Identifying Hierarchies of Outstanding Peaks in Social Networks},<br\/>  volume = 14171,<br\/>  year = 2023<br\/>}<br\/><\/p><\/div><div class=\"bibsonomycsl_collapse bibsonomycsl_pub_endnote\" style=\"display:none;\" id=\"end-004218ef1c7594c76cc7b433e950bef0\"><p>%0 Conference Paper<br\/>%1 DBLP:conf\/pkdd\/StubbemannS23<br\/>%A Stubbemann, Maximilian<br\/>%A Stumme, Gerd<br\/>%B European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML-PKDD 2023<br\/>%D 2023<br\/>%I Springer<br\/>%P 177--192<br\/>%R 10.1007\/978-3-031-43418-1\\_11<br\/>%T The Mont Blanc of Twitter: Identifying Hierarchies of Outstanding Peaks in Social Networks<br\/>%U https:\/\/doi.org\/10.1007\/978-3-031-43418-1\\_11<br\/>%V 14171<br\/><\/p><\/div><\/div><\/li><li class=\"bibsonomycsl_pubitem\"><div class=\"bibsonomycsl_entry\"><div class=\"csl-bib-body\">\n  <div class=\"csl-entry\"><div class=\"csl-left-margin\"><span style=\"display: none;\">1.<\/span><\/div><div class=\"csl-right-inline\"><span class=\"csl-author\">Stubbemann, M., Hille, T., Hanika, T.: <\/span><span class=\"csl-title\"><span class=\"csl-title\">Selecting Features by their Resilience to the Curse of Dimensionality<\/span>.<\/span> (2023).<\/div><\/div>\n<\/div><span class=\"bibsonomycsl_export bibsonomycsl_bibtex\"><a rel=\"bib-226d9608a2279971502570f1c18c5fa4\" href=\"#\">BibTeX<\/a><\/span><span class=\"bibsonomycsl_export bibsonomycsl_endnote\"><a rel=\"end-226d9608a2279971502570f1c18c5fa4\" href=\"#\">EndNote<\/a><\/span><div style=\"clear: left\"> <\/div><div class=\"bibsonomycsl_collapse bibsonomycsl_pub_bibtex\" style=\"display:none;\" id=\"bib-226d9608a2279971502570f1c18c5fa4\"><p>@article{stubbemann2023selecting,<br\/>  author = {Stubbemann, Maximilian and Hille, Tobias and Hanika, Tom},<br\/>  keywords = {itegpub},<br\/>  title = {Selecting Features by their Resilience to the Curse of Dimensionality},<br\/>  year = 2023<br\/>}<br\/><\/p><\/div><div class=\"bibsonomycsl_collapse bibsonomycsl_pub_endnote\" style=\"display:none;\" id=\"end-226d9608a2279971502570f1c18c5fa4\"><p>%0 Journal Article<br\/>%1 stubbemann2023selecting<br\/>%A Stubbemann, Maximilian<br\/>%A Hille, Tobias<br\/>%A Hanika, Tom<br\/>%D 2023<br\/>%T Selecting Features by their Resilience to the Curse of Dimensionality<br\/><\/p><\/div><\/div><\/li><li class=\"bibsonomycsl_pubitem\"><div class=\"bibsonomycsl_entry\"><div class=\"csl-bib-body\">\n  <div class=\"csl-entry\"><div class=\"csl-left-margin\"><span style=\"display: none;\">1.<\/span><\/div><div class=\"csl-right-inline\"><span class=\"csl-author\">Stubbemann, M., Stumme, G.: <\/span><span class=\"csl-title\"><span class=\"csl-title\">LG4AV: Combining Language Models and Graph Neural Networks for Author Verification<\/span>.<\/span> In: Bouadi, T., Fromont, E., and H{\u00fc}llermeier, E. (eds.) Advances in Intelligent Data Analysis XX. pp. 315\u2013326. Springer International Publishing, Cham (2022).<\/div><\/div>\n<\/div><span class=\"bibsonomycsl_url\"><a href=\"https:\/\/link.springer.com\/chapter\/10.1007\/978-3-031-01333-1_25\" target=\"_blank\">URL<\/a><\/span><span class=\"bibsonomycsl_export bibsonomycsl_bibtex\"><a rel=\"bib-eb5cdf718b65cf848d9e9fc6d508c614\" href=\"#\">BibTeX<\/a><\/span><span class=\"bibsonomycsl_export bibsonomycsl_endnote\"><a rel=\"end-eb5cdf718b65cf848d9e9fc6d508c614\" href=\"#\">EndNote<\/a><\/span><div style=\"clear: left\"> <\/div><div class=\"bibsonomycsl_collapse bibsonomycsl_pub_abstract\" style=\"display:none;\" id=\"abs-eb5cdf718b65cf848d9e9fc6d508c614\">The verification of document authorships is important in various settings. Researchers are for example judged and compared by the amount and impact of their publications and public figures are confronted by their posts on social media. Therefore, it is important that authorship information in frequently used data sets is correct. The question whether a given document is written by a given author is commonly referred to as authorship verification (AV). While AV is a widely investigated problem in general, only few works consider settings where the documents are short and written in a rather uniform style. This makes most approaches impractical for bibliometric data. Here, authorships of scientific publications have to be verified, often with just abstracts and titles available. To this point, we present LG4AV which combines language models and graph neural networks for authorship verification. By directly feeding the available texts in a pre-trained transformer architecture, our model does not need any hand-crafted stylometric features that are not meaningful in scenarios where the writing style is, at least to some extent, standardized. By the incorporation of a graph neural network structure, our model can benefit from relations between authors that are meaningful with respect to the verification process.<\/div><div class=\"bibsonomycsl_collapse bibsonomycsl_pub_bibtex\" style=\"display:none;\" id=\"bib-eb5cdf718b65cf848d9e9fc6d508c614\"><p>@inproceedings{10.1007\/978-3-031-01333-1_25,<br\/>  abstract = {The verification of document authorships is important in various settings. Researchers are for example judged and compared by the amount and impact of their publications and public figures are confronted by their posts on social media. Therefore, it is important that authorship information in frequently used data sets is correct. The question whether a given document is written by a given author is commonly referred to as authorship verification (AV). While AV is a widely investigated problem in general, only few works consider settings where the documents are short and written in a rather uniform style. This makes most approaches impractical for bibliometric data. Here, authorships of scientific publications have to be verified, often with just abstracts and titles available. To this point, we present LG4AV which combines language models and graph neural networks for authorship verification. By directly feeding the available texts in a pre-trained transformer architecture, our model does not need any hand-crafted stylometric features that are not meaningful in scenarios where the writing style is, at least to some extent, standardized. By the incorporation of a graph neural network structure, our model can benefit from relations between authors that are meaningful with respect to the verification process.},<br\/>  address = {Cham},<br\/>  author = {Stubbemann, Maximilian and Stumme, Gerd},<br\/>  booktitle = {Advances in Intelligent Data Analysis XX},<br\/>  editor = {Bouadi, Tassadit and Fromont, Elisa and H{\u00fc}llermeier, Eyke},<br\/>  keywords = {itegpub},<br\/>  pages = {315--326},<br\/>  publisher = {Springer International Publishing},<br\/>  title = {LG4AV: Combining Language Models and Graph Neural Networks for Author Verification},<br\/>  year = 2022<br\/>}<br\/><\/p><\/div><div class=\"bibsonomycsl_collapse bibsonomycsl_pub_endnote\" style=\"display:none;\" id=\"end-eb5cdf718b65cf848d9e9fc6d508c614\"><p>%0 Conference Paper<br\/>%1 10.1007\/978-3-031-01333-1_25<br\/>%A Stubbemann, Maximilian<br\/>%A Stumme, Gerd<br\/>%B Advances in Intelligent Data Analysis XX<br\/>%C Cham<br\/>%D 2022<br\/>%E Bouadi, Tassadit<br\/>%E Fromont, Elisa<br\/>%E H{\u00fc}llermeier, Eyke<br\/>%I Springer International Publishing<br\/>%P 315--326<br\/>%T LG4AV: Combining Language Models and Graph Neural Networks for Author Verification<br\/>%U https:\/\/link.springer.com\/chapter\/10.1007\/978-3-031-01333-1_25<br\/>%X The verification of document authorships is important in various settings. Researchers are for example judged and compared by the amount and impact of their publications and public figures are confronted by their posts on social media. Therefore, it is important that authorship information in frequently used data sets is correct. The question whether a given document is written by a given author is commonly referred to as authorship verification (AV). While AV is a widely investigated problem in general, only few works consider settings where the documents are short and written in a rather uniform style. This makes most approaches impractical for bibliometric data. Here, authorships of scientific publications have to be verified, often with just abstracts and titles available. To this point, we present LG4AV which combines language models and graph neural networks for authorship verification. By directly feeding the available texts in a pre-trained transformer architecture, our model does not need any hand-crafted stylometric features that are not meaningful in scenarios where the writing style is, at least to some extent, standardized. By the incorporation of a graph neural network structure, our model can benefit from relations between authors that are meaningful with respect to the verification process.<br\/>%@ 978-3-031-01333-1<br\/><\/p><\/div><\/div><\/li><li class=\"bibsonomycsl_pubitem\"><div class=\"bibsonomycsl_entry\"><div class=\"csl-bib-body\">\n  <div class=\"csl-entry\"><div class=\"csl-left-margin\"><span style=\"display: none;\">1.<\/span><\/div><div class=\"csl-right-inline\"><span class=\"csl-author\">Koopmann, T., Stubbemann, M., Kapa, M., Paris, M., Buenstorf, G., Hanika, T., Hotho, A., J\u00e4schke, R., Stumme, G.: <\/span><span class=\"csl-title\"><span class=\"csl-title\">Proximity dimensions and the emergence of collaboration: a HypTrails study on German AI research<\/span>.<\/span> Scientometrics. (2021). https:\/\/doi.org\/10.1007\/s11192-021-03922-1.<\/div><\/div>\n<\/div><span class=\"bibsonomycsl_url\"><a href=\"https:\/\/doi.org\/10.1007\/s11192-021-03922-1\" target=\"_blank\">URL<\/a><\/span><span class=\"bibsonomycsl_export bibsonomycsl_bibtex\"><a rel=\"bib-403544c0381c5a42e340c6f288bee105\" href=\"#\">BibTeX<\/a><\/span><span class=\"bibsonomycsl_export bibsonomycsl_endnote\"><a rel=\"end-403544c0381c5a42e340c6f288bee105\" href=\"#\">EndNote<\/a><\/span><div style=\"clear: left\"> <\/div><div class=\"bibsonomycsl_collapse bibsonomycsl_pub_abstract\" style=\"display:none;\" id=\"abs-403544c0381c5a42e340c6f288bee105\">Creation and exchange of knowledge depends on collaboration. Recent work has suggested that the emergence of collaboration frequently relies on geographic proximity. However, being co-located tends to be associated with other dimensions of proximity, such as social ties or a shared organizational environment. To account for such factors, multiple dimensions of proximity have been proposed, including cognitive, institutional, organizational, social and geographical proximity. Since they strongly interrelate, disentangling these dimensions and their respective impact on collaboration is challenging. To address this issue, we propose various methods for measuring different dimensions of proximity. We then present an approach to compare and rank them with respect to the extent to which they indicate co-publications and co-inventions. We adapt the HypTrails approach, which was originally developed to explain human navigation, to co-author and co-inventor graphs. We evaluate this approach on a subset of the German research community, specifically academic authors and inventors active in research on artificial intelligence (AI). We find that social proximity and cognitive proximity are more important for the emergence of collaboration than geographic proximity.<\/div><div class=\"bibsonomycsl_collapse bibsonomycsl_pub_bibtex\" style=\"display:none;\" id=\"bib-403544c0381c5a42e340c6f288bee105\"><p>@article{koopmann2021proximity,<br\/>  abstract = {Creation and exchange of knowledge depends on collaboration. Recent work has suggested that the emergence of collaboration frequently relies on geographic proximity. However, being co-located tends to be associated with other dimensions of proximity, such as social ties or a shared organizational environment. To account for such factors, multiple dimensions of proximity have been proposed, including cognitive, institutional, organizational, social and geographical proximity. Since they strongly interrelate, disentangling these dimensions and their respective impact on collaboration is challenging. To address this issue, we propose various methods for measuring different dimensions of proximity. We then present an approach to compare and rank them with respect to the extent to which they indicate co-publications and co-inventions. We adapt the HypTrails approach, which was originally developed to explain human navigation, to co-author and co-inventor graphs. We evaluate this approach on a subset of the German research community, specifically academic authors and inventors active in research on artificial intelligence (AI). We find that social proximity and cognitive proximity are more important for the emergence of collaboration than geographic proximity.},<br\/>  author = {Koopmann, Tobias and Stubbemann, Maximilian and Kapa, Matthias and Paris, Michael and Buenstorf, Guido and Hanika, Tom and Hotho, Andreas and J\u00e4schke, Robert and Stumme, Gerd},<br\/>  journal = {Scientometrics},<br\/>  keywords = {regio},<br\/>  title = {Proximity dimensions and the emergence of collaboration: a HypTrails study on German AI research},<br\/>  year = 2021<br\/>}<br\/><\/p><\/div><div class=\"bibsonomycsl_collapse bibsonomycsl_pub_endnote\" style=\"display:none;\" id=\"end-403544c0381c5a42e340c6f288bee105\"><p>%0 Journal Article<br\/>%1 koopmann2021proximity<br\/>%A Koopmann, Tobias<br\/>%A Stubbemann, Maximilian<br\/>%A Kapa, Matthias<br\/>%A Paris, Michael<br\/>%A Buenstorf, Guido<br\/>%A Hanika, Tom<br\/>%A Hotho, Andreas<br\/>%A J\u00e4schke, Robert<br\/>%A Stumme, Gerd<br\/>%D 2021<br\/>%J Scientometrics<br\/>%R 10.1007\/s11192-021-03922-1<br\/>%T Proximity dimensions and the emergence of collaboration: a HypTrails study on German AI research<br\/>%U https:\/\/doi.org\/10.1007\/s11192-021-03922-1<br\/>%X Creation and exchange of knowledge depends on collaboration. Recent work has suggested that the emergence of collaboration frequently relies on geographic proximity. However, being co-located tends to be associated with other dimensions of proximity, such as social ties or a shared organizational environment. To account for such factors, multiple dimensions of proximity have been proposed, including cognitive, institutional, organizational, social and geographical proximity. Since they strongly interrelate, disentangling these dimensions and their respective impact on collaboration is challenging. To address this issue, we propose various methods for measuring different dimensions of proximity. We then present an approach to compare and rank them with respect to the extent to which they indicate co-publications and co-inventions. We adapt the HypTrails approach, which was originally developed to explain human navigation, to co-author and co-inventor graphs. We evaluate this approach on a subset of the German research community, specifically academic authors and inventors active in research on artificial intelligence (AI). We find that social proximity and cognitive proximity are more important for the emergence of collaboration than geographic proximity.<br\/><\/p><\/div><\/div><\/li><li class=\"bibsonomycsl_pubitem\"><div class=\"bibsonomycsl_entry\"><div class=\"csl-bib-body\">\n  <div class=\"csl-entry\"><div class=\"csl-left-margin\"><span style=\"display: none;\">1.<\/span><\/div><div class=\"csl-right-inline\"><span class=\"csl-author\">Stubbemann, M., Hanika, T., Stumme, G.: <\/span><span class=\"csl-title\"><span class=\"csl-title\">Orometric Methods in Bounded Metric Data<\/span>.<\/span> In: Berthold, M.R., Feelders, A., and Krempl, G. (eds.) Advances in Intelligent Data Analysis {XVIII} - 18th International Symposium on Intelligent Data Analysis, {IDA} 2020, Konstanz, Germany, April 27-29, 2020, Proceedings. pp. 496\u2013508. Springer (2020). https:\/\/doi.org\/10.1007\/978-3-030-44584-3\\_39.<\/div><\/div>\n<\/div><span class=\"bibsonomycsl_url\"><a href=\"https:\/\/doi.org\/10.1007\/978-3-030-44584-3\\_39\" target=\"_blank\">URL<\/a><\/span><span class=\"bibsonomycsl_export bibsonomycsl_bibtex\"><a rel=\"bib-46473c28ad59dcf47f2dbe1427740370\" href=\"#\">BibTeX<\/a><\/span><span class=\"bibsonomycsl_export bibsonomycsl_endnote\"><a rel=\"end-46473c28ad59dcf47f2dbe1427740370\" href=\"#\">EndNote<\/a><\/span><div style=\"clear: left\"> <\/div><div class=\"bibsonomycsl_collapse bibsonomycsl_pub_bibtex\" style=\"display:none;\" id=\"bib-46473c28ad59dcf47f2dbe1427740370\"><p>@inproceedings{DBLP:conf\/ida\/StubbemannHS20,<br\/>  author = {Stubbemann, Maximilian and Hanika, Tom and Stumme, Gerd},<br\/>  booktitle = {Advances in Intelligent Data Analysis {XVIII} - 18th International Symposium on Intelligent Data Analysis, {IDA} 2020, Konstanz, Germany, April 27-29, 2020, Proceedings},<br\/>  editor = {Berthold, Michael R. and Feelders, Ad and Krempl, Georg},<br\/>  keywords = {itegpub},<br\/>  pages = {496--508},<br\/>  publisher = {Springer},<br\/>  series = {Lecture Notes in Computer Science},<br\/>  title = {Orometric Methods in Bounded Metric Data},<br\/>  volume = 12080,<br\/>  year = 2020<br\/>}<br\/><\/p><\/div><div class=\"bibsonomycsl_collapse bibsonomycsl_pub_endnote\" style=\"display:none;\" id=\"end-46473c28ad59dcf47f2dbe1427740370\"><p>%0 Conference Paper<br\/>%1 DBLP:conf\/ida\/StubbemannHS20<br\/>%A Stubbemann, Maximilian<br\/>%A Hanika, Tom<br\/>%A Stumme, Gerd<br\/>%B Advances in Intelligent Data Analysis {XVIII} - 18th International Symposium on Intelligent Data Analysis, {IDA} 2020, Konstanz, Germany, April 27-29, 2020, Proceedings<br\/>%D 2020<br\/>%E Berthold, Michael R.<br\/>%E Feelders, Ad<br\/>%E Krempl, Georg<br\/>%I Springer<br\/>%P 496--508<br\/>%R 10.1007\/978-3-030-44584-3\\_39<br\/>%T Orometric Methods in Bounded Metric Data<br\/>%U https:\/\/doi.org\/10.1007\/978-3-030-44584-3\\_39<br\/>%V 12080<br\/><\/p><\/div><\/div><\/li><li class=\"bibsonomycsl_pubitem\"><div class=\"bibsonomycsl_entry\"><div class=\"csl-bib-body\">\n  <div class=\"csl-entry\"><div class=\"csl-left-margin\"><span style=\"display: none;\">1.<\/span><\/div><div class=\"csl-right-inline\"><span class=\"csl-author\">D\u00fcrrschnabel, D., Hanika, T., Stubbemann, M.: <\/span><span class=\"csl-title\"><span class=\"csl-title\">FCA2VEC: Embedding Techniques for Formal Concept Analysis<\/span><\/span>, http:\/\/arxiv.org\/abs\/1911.11496, (2019).<\/div><\/div>\n<\/div><span class=\"bibsonomycsl_url\"><a href=\"http:\/\/arxiv.org\/abs\/1911.11496\" target=\"_blank\">URL<\/a><\/span><span class=\"bibsonomycsl_export bibsonomycsl_bibtex\"><a rel=\"bib-bc5ab586fced13373155352a704a42fb\" href=\"#\">BibTeX<\/a><\/span><span class=\"bibsonomycsl_export bibsonomycsl_endnote\"><a rel=\"end-bc5ab586fced13373155352a704a42fb\" href=\"#\">EndNote<\/a><\/span><div style=\"clear: left\"> <\/div><div class=\"bibsonomycsl_collapse bibsonomycsl_pub_abstract\" style=\"display:none;\" id=\"abs-bc5ab586fced13373155352a704a42fb\">Embedding large and high dimensional data into low dimensional vector spaces is a necessary task to computationally cope with contemporary data sets. Superseding latent semantic analysis recent approaches like word2vec or node2vec are well established tools in this realm. In the present paper we add to this line of research by introducing fca2vec, a family of embedding techniques for formal concept analysis (FCA). Our investigation contributes to two distinct lines of research. First, we enable the application of FCA notions to large data sets. In particular, we demonstrate how the cover relation of a concept lattice can be retrieved from a computational feasible embedding. Secondly, we show an enhancement for the classical node2vec approach in low dimension. For both directions the overall constraint of FCA of explainable results is preserved. We evaluate our novel procedures by computing fca2vec on different data sets like, wiki44 (a dense part of the Wikidata knowledge graph), the Mushroom data set and a publication network derived from the FCA community.<\/div><div class=\"bibsonomycsl_collapse bibsonomycsl_pub_bibtex\" style=\"display:none;\" id=\"bib-bc5ab586fced13373155352a704a42fb\"><p>@misc{durrschnabel2019fca2vec,<br\/>  abstract = {Embedding large and high dimensional data into low dimensional vector spaces is a necessary task to computationally cope with contemporary data sets. Superseding latent semantic analysis recent approaches like word2vec or node2vec are well established tools in this realm. In the present paper we add to this line of research by introducing fca2vec, a family of embedding techniques for formal concept analysis (FCA). Our investigation contributes to two distinct lines of research. First, we enable the application of FCA notions to large data sets. In particular, we demonstrate how the cover relation of a concept lattice can be retrieved from a computational feasible embedding. Secondly, we show an enhancement for the classical node2vec approach in low dimension. For both directions the overall constraint of FCA of explainable results is preserved. We evaluate our novel procedures by computing fca2vec on different data sets like, wiki44 (a dense part of the Wikidata knowledge graph), the Mushroom data set and a publication network derived from the FCA community.},<br\/>  author = {D\u00fcrrschnabel, Dominik and Hanika, Tom and Stubbemann, Maximilian},<br\/>  keywords = {itegpub},<br\/>  note = {cite arxiv:1911.11496Comment: 25 pages},<br\/>  title = {FCA2VEC: Embedding Techniques for Formal Concept Analysis},<br\/>  year = 2019<br\/>}<br\/><\/p><\/div><div class=\"bibsonomycsl_collapse bibsonomycsl_pub_endnote\" style=\"display:none;\" id=\"end-bc5ab586fced13373155352a704a42fb\"><p>%0 Generic<br\/>%1 durrschnabel2019fca2vec<br\/>%A D\u00fcrrschnabel, Dominik<br\/>%A Hanika, Tom<br\/>%A Stubbemann, Maximilian<br\/>%D 2019<br\/>%T FCA2VEC: Embedding Techniques for Formal Concept Analysis<br\/>%U http:\/\/arxiv.org\/abs\/1911.11496<br\/>%X Embedding large and high dimensional data into low dimensional vector spaces is a necessary task to computationally cope with contemporary data sets. Superseding latent semantic analysis recent approaches like word2vec or node2vec are well established tools in this realm. In the present paper we add to this line of research by introducing fca2vec, a family of embedding techniques for formal concept analysis (FCA). Our investigation contributes to two distinct lines of research. First, we enable the application of FCA notions to large data sets. In particular, we demonstrate how the cover relation of a concept lattice can be retrieved from a computational feasible embedding. Secondly, we show an enhancement for the classical node2vec approach in low dimension. For both directions the overall constraint of FCA of explainable results is preserved. We evaluate our novel procedures by computing fca2vec on different data sets like, wiki44 (a dense part of the Wikidata knowledge graph), the Mushroom data set and a publication network derived from the FCA community.<br\/><\/p><\/div><\/div><\/li><\/ul><\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Talks<\/h2>\n\n\n\n<ul class=\"wp-block-list\">\n<li>September 2023: <em>The Mont Blanc of&nbsp;Twitter: Identifying Hierarchies of&nbsp;Outstanding Peaks in&nbsp;Social Networks<\/em>, ECML\/PKDD 2023<\/li>\n\n\n\n<li>September 2023: <em>Intrinsic Dimensionality and Graph Learning, <\/em>ISI Foundation Turin<\/li>\n\n\n\n<li>January 2023: <em>Topological Data Analysis and Neural Networks<\/em>, Dagstuhl Workshop on Concept Lattice Based Topological Data Analysis and Reasoning<\/li>\n\n\n\n<li>April 2022: <em>LG4AV: Combining Language Models and Graph Neural Networks for Author Verification<\/em>, Symposium on Intelligent Data Analysis 2022<\/li>\n\n\n\n<li>July 2021: <em>Dimensionen von N\u00e4he und ihr Einfluss auf die Entstehung von Kollaboration<\/em>, ITeG Brown Bag Seminar<\/li>\n\n\n\n<li>April 2020: <em>Orometric Methods in Bounded Metric Data<\/em>, Symposium on Intelligent Data Analysis 2020<\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\">Research Communication<\/h2>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Podcast of the University of Kassel: <a href=\"https:\/\/www.uni-kassel.de\/uni\/aktuelles\/meldung\/2023\/02\/21\/so-funktioniert-chatgpt?cHash=4ea04341b87e56c93d90bef609559ead\">How does ChatGpt work? (In German<\/a>)<\/li>\n\n\n\n<li>Participation in a radio report on ChatGPT on the Hessian radio station HR4, broadcast on 23.03.2023 at 11:15 a.m.<\/li>\n\n\n\n<li>Inteview for the local magazine <a href=\"https:\/\/mein-kassel.com\/e-paper#dearflip-df_5516\/11\/\">&#8220;Mein Kassel&#8221; (in German)<\/a> :<br>Current state and future of language models and their implications four our lives. <\/li>\n\n\n\n<li>External Lecturer at the Departement of Empirical Research on Schools and Teaching:<br>ChatGPT et al. : Functionality, History and Consequences for Teaching (in German)<br><\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\">Reviewing<\/h2>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Subreviewer: European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, September 19-23 2022<\/li>\n\n\n\n<li>Subreviewer: 20th International Semantic Web Conference,  October 24-28 2021, Virtual Conference<\/li>\n\n\n\n<li>Subreviewer: 18th Extended Semantic Web Conference, June 6-10 2021, Hersonissos, Greece<\/li>\n\n\n\n<li>Subreviewer: 19th International Semantic Web Conference, November 1-6 2020, Virtual Conference<\/li>\n\n\n\n<li>Subreviewer: 25th International Conference on Conceptual Structures, September 18-21 2020, Bolzano, Italy<\/li>\n\n\n\n<li>Subreviewer: 24th European Conference on Artificial Intelligence, June 8-12 2020, Santiago de Compostela, Spain<\/li>\n\n\n\n<li>Subreviewer: 25th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, August 4 &#8211; 8, 2019, Anchorage, Alaska &#8211; USA<\/li>\n\n\n\n<li>Subreviewer: <span xmlns:v=\"http:\/\/rdf.data-vocabulary.org\/#\" typeof=\"v:Event\"><span property=\"v:description\">15th International Conference on Formal Concept Analysis, June 25-28 2019 &#8211; Frankfurt, Germany&nbsp;<\/span><\/span><\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\">Projects<\/h2>\n\n\n\n<h3 class=\"wp-block-heading\"><a href=\"http:\/\/regio-project.org\" data-type=\"URL\" data-id=\"regio-project.org\">REGIO (2018-2021)<\/a><\/h3>\n\n\n\n<p>REGIO was a joint project between the University of Kassel, the L3S Research Center Hannover, the HU Berlin and the University of W\u00fcrzburg. Together, we established new methodologies and data sources which helped to better understand the impact of geographic and thematic proximity on the genesis and the success of interaction in science and R&amp;D. Our findings indicate, <a href=\"https:\/\/link.springer.com\/article\/10.1007\/s11192-021-03922-1\" data-type=\"URL\" data-id=\"https:\/\/link.springer.com\/article\/10.1007\/s11192-021-03922-1\">that social and thematic proximity are the key factors for cooperation<\/a>.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><a href=\"https:\/\/www.uni-kassel.de\/uni\/aktuelles\/meldung\/2021\/12\/3\/loewe-foerderung-fluch-der-dimension-bei-maschinellen-lernverfahren?cHash=3eec618d68af11955cda64c31b5d89a7\" data-type=\"URL\" data-id=\"https:\/\/www.uni-kassel.de\/uni\/aktuelles\/meldung\/2021\/12\/3\/loewe-foerderung-fluch-der-dimension-bei-maschinellen-lernverfahren?cHash=3eec618d68af11955cda64c31b5d89a7\">Dimension Curse Detector (2022-)<\/a><\/h3>\n\n\n\n<p>This project is led by <a href=\"https:\/\/www.kde.cs.uni-kassel.de\/hanika\" data-type=\"page\" data-id=\"137\">Dr. Tom Hanika<\/a> within the <a href=\"https:\/\/wissenschaft.hessen.de\/Forschen\/Landesprogramm-LOEWE\/LOEWE-Exploration\">Loewe Exploration program.<\/a> In this project, we investigate to which extent machine learning is influenced by the intrinsic dimensionality of data. For this, we quantify the Curse of Dimensionality which is strongly connected to the phenomenon of measure concentration. Within the project, we develop methods that allow to efficiently compute the intrinsic dimension of modern large-scale datasets.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Teaching<\/h2>\n\n\n\n<p>Together with <a href=\"https:\/\/www.kde.cs.uni-kassel.de\/hirth\">Johannes Hirth<\/a>, I developed a Clojure programming curse which first took place in summer 2019. The course is hold until today on a regular basis.<\/p>\n\n\n\n<p>Wintersemester 2022<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"https:\/\/www.kde.cs.uni-kassel.de\/lehre\/ws2022-23\/kdd_praktikum\" data-type=\"page\" data-id=\"8576\">Knowledge Discovery (Praktikum)<\/a><\/li>\n<\/ul>\n\n\n\n<p>Sommersemester 2021<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"https:\/\/www.kde.cs.uni-kassel.de\/lehre\/ss2022\/datenbanken\" data-type=\"page\" data-id=\"8256\">Datenbanken<\/a><\/li>\n<\/ul>\n\n\n\n<p>Wintersemester 2021<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"https:\/\/www.kde.cs.uni-kassel.de\/lehre\/ws2020-21\/kdd_praktikum\" data-type=\"page\" data-id=\"7257\">Knowledge Discovery (Praktikum)<\/a><\/li>\n<\/ul>\n\n\n\n<p>Sommersemester 2020<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"https:\/\/www.kde.cs.uni-kassel.de\/lehre\/ss2020\/datenbanken\">Datenbanken<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/www.kde.cs.uni-kassel.de\/lehre\/ss2020\/ausgewaehlte-themen-der-wissensverarbeitung-seminar\">Seminar: Ausgew\u00e4hlte Themen der Wissensverarbeitung<\/a><\/li>\n<\/ul>\n\n\n\n<p>Winter 2019\/20<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"https:\/\/www.kde.cs.uni-kassel.de\/lehre\/ws2019-20\/sna\">Soziale Netzwerkanalyse<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/www.kde.cs.uni-kassel.de\/lehre\/ws2019-20\/ln\">Labor Netzwerke<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/www.kde.cs.uni-kassel.de\/lehre\/ws2019-20\/clojurekurs\">Clojure-Kurs<\/a><\/li>\n<\/ul>\n\n\n\n<p>Summer 2019<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"https:\/\/www.kde.cs.uni-kassel.de\/lehre\/ss2019\/datenbanken\">Datenbanken<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/www.kde.cs.uni-kassel.de\/lehre\/ss2019\/clojurekurs\">Clojure-Kurs<\/a><\/li>\n<\/ul>\n\n\n\n<p>Winter 2018\/2019<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"https:\/\/www.kde.cs.uni-kassel.de\/lehre\/ws2018-19\/kdd_praktikum\">Knowledge Discovery (Praktikum)<\/a><\/li>\n<\/ul>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>I have left the Knowledge and Data Engineering Group on 31.10.2023. I am now working at the Information Systems and Machine Learning Lab of the University of Hildesheim. My new homepage is available here.Email: stubbemann@cs.uni-kassel.de<a class=\"moretag\" href=\"https:\/\/www.kde.cs.uni-kassel.de\/en\/stubbemann\"> Read more&hellip;<\/a><\/p>\n","protected":false},"author":15,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-4645","page","type-page","status-publish","hentry"],"translation":{"provider":"WPGlobus","version":"3.0.2","language":"en","enabled_languages":["de","en"],"languages":{"de":{"title":true,"content":true,"excerpt":false},"en":{"title":false,"content":false,"excerpt":false}}},"_links":{"self":[{"href":"https:\/\/www.kde.cs.uni-kassel.de\/en\/wp-json\/wp\/v2\/pages\/4645","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.kde.cs.uni-kassel.de\/en\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/www.kde.cs.uni-kassel.de\/en\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/www.kde.cs.uni-kassel.de\/en\/wp-json\/wp\/v2\/users\/15"}],"replies":[{"embeddable":true,"href":"https:\/\/www.kde.cs.uni-kassel.de\/en\/wp-json\/wp\/v2\/comments?post=4645"}],"version-history":[{"count":43,"href":"https:\/\/www.kde.cs.uni-kassel.de\/en\/wp-json\/wp\/v2\/pages\/4645\/revisions"}],"predecessor-version":[{"id":9522,"href":"https:\/\/www.kde.cs.uni-kassel.de\/en\/wp-json\/wp\/v2\/pages\/4645\/revisions\/9522"}],"wp:attachment":[{"href":"https:\/\/www.kde.cs.uni-kassel.de\/en\/wp-json\/wp\/v2\/media?parent=4645"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}