logo IPI PAN

dr hab. Łukasz Dębowski

  1. Ł. Dębowski, (2013). Information Theory and Statistics. Institute of Computer Science, Polish Academy of Sciences. pdf
Chapters in monographs
  1. E. Charzyńska, Ł. Dębowski, W. Gruszczyński, M. Hadryan, (2015). Historia badań nad zrozumiałością tekstu. In: W. Gruszczyński, M. Ogrodniczuk, eds., Jasnopis, czyli mierzenie zrozumiałości polskich tekstów użytkowych. Warszawa: Oficyna Wydawnicza ASPRA-JR. (pp. 11–38)
  2. Ł. Dębowski, (2015). Konstrukcja nowych formuł analitycznych. In: W. Gruszczyński, M. Ogrodniczuk, eds., Jasnopis, czyli mierzenie zrozumiałości polskich tekstów użytkowych. Warszawa: Oficyna Wydawnicza ASPRA-JR. (pp. 109–126)
Journal articles
  1. R. Takahira, K. Tanaka-Ishii, Ł. Dębowski, (2016). Entropy Rate Estimates for Natural Language–A New Extrapolation of Compressed Large-Scale Corpora. Entropy, vol. 18(10), pp. 364. pdf
  2. Ł. Dębowski, (2015). The Relaxed Hilberg Conjecture: A Review and New Experimental Support. Journal of Quantitative Linguistics, vol. 22, pp. 311–337. pdf
  3. E. Charzyńska, Ł. Dębowski, (2015). Empirical verification of the Polish formula of text difficulty. Cognitive studies / Etudes cognitives, vol. 15, pp. 125–132. pdf
  4. Ł. Dębowski, (2015). A Preadapted Universal Switch Distribution for Testing Hilberg's Conjecture. IEEE Transactions on Information Theory, vol. 61, pp. 5708–5715. arXiv
  5. Ł. Dębowski, (2015). Hilberg Exponents: New Measures of Long Memory in the Process. IEEE Transactions on Information Theory, vol. 61, pp. 5716–5726. arXiv
  6. Ł. Dębowski, (2015). Maximal Repetitions in Written Texts: Finite Energy Hypothesis vs. Strong Hilberg Conjecture. Entropy, vol. 17, pp. 5903–5919. pdf
  7. Ł. Dębowski, (2014). Hilberg's Conjecture—a Challenge for Machine Learning. Schedae Informaticae, vol. 23, pp. 33-44. pdf
  8. R. Ferrer-i-Cancho, A. Hernández-Fernández, J. Baixeries, Ł. Dębowski, J. Mačutek, (2014). When is Menzerath-Altmann law mathematically trivial? A new approach. Statistical Applications in Genetics and Molecular Biology, vol. 13, pp. 633-644. pdf
  9. Ł. Dębowski, (2014). On Hidden Markov Processes with Infinite Excess Entropy. Journal of Theoretical Probability, vol. 27, pp. 539-551. arXiv
  10. R. Ferrer-i-Cancho, Ł. Dębowski, F. Moscoso del Prado Martin, (2013). Constant conditional entropy and related hypotheses. Journal of Statistical Mechanics: Theory and Experiment, L07001. pdf
  11. Ł. Dębowski, (2012). On Bounded Redundancy of Universal Codes. Statistics and Probability Letters, vol. 82, pp. 2068-2071. pdf
  12. Ł. Dębowski, (2012). Mixing, Ergodic, and Nonergodic Processes with Rapidly Growing Information between Blocks. IEEE Transactions on Information Theory, vol. 58, pp. 3392-3401. arXiv
  13. Ł. Dębowski, (2011). Excess entropy in natural language: present state and perspectives. Chaos, vol. 21, pp. 037105 (11 pages) arXiv
  14. Ł. Dębowski, (2011). On processes with hyperbolically decaying autocorrelations. Journal of Time Series Analysis, vol. 32, pp. 580-584. pdf
  15. Ł. Dębowski, (2011). On the Vocabulary of Grammar-Based Codes and the Logical Consistency of Texts. IEEE Transactions on Information Theory, vol. 57, pp. 4589-4599. arXiv
  16. Ł. Dębowski, (2010). Variable-Length Coding of Two-Sided Asymptotically Mean Stationary Measures. Journal of Theoretical Probability, vol. 23, pp. 237-256. arXiv
  17. Ł. Dębowski, (2009). Valence extraction using EM selection and co-occurrence matrices. Language Resources and Evaluation, vol. 43, pp. 301-327. arXiv
  18. Ł. Dębowski, (2009). A general definition of conditional information and its application to ergodic decomposition. Statistics & Probability Letters, vol. 79, pp. 1260-1268. pdf
  19. Ł. Dębowski, (2007). On processes with summable partial autocorrelations. Statistics & Probability Letters, vol. 77, pp. 752-759. pdf
  20. Ł. Dębowski, (2006). On Hilberg's Law and Its Links with Guiraud's Law. Journal of Quantitative Linguistics, vol. 13, no. 1, pp. 81-109. arXiv
  21. A. Przepiórkowski, P. Bański, Ł. Dębowski, E. Hajnicz, M. Woliński, (2005). The IPI PAN Corpus. Annual Report of the Polish Academy of Sciences, pp. 54-55.
  22. A. Przepiórkowski, P. Bański, Ł. Dębowski, E. Hajnicz, M. Woliński, (2003). Konstrukcja korpusu IPI PAN. Polonica, vol. XXII-XXIII, pp. 33-38. pdf
  23. Ł. Dębowski, (2002). Zipf's law against the text size: A half-rational model. Glottometrics, vol. 4, pp. 49-60 (Special volume: To honor G. K. Zipf). pdf
  24. Ł. Dębowski, J. Hajič, V. Kuboň, (2002). Testing the limits—Adding a new language to an MT system. The Prague Bulletin of Mathematical Linguistics, vol. 78, pp. 91-101.
Articles in conference proceedings
  1. Ł. Dębowski, (2016). Consistency of the Plug-In Estimator of the Entropy Rate for Ergodic Processes. In: 2016 IEEE International Symposium on Information Theory. (ISIT) (pp. 1651–1655) arXiv
  2. Ł. Dębowski, B. Broda, B. Nitoń, E. Charzyńska, (2015). Jasnopis — A Program to Compute Readability of Texts in Polish Based on Psycholinguistic Research. In: B. Sharp, W. Lubaszewski and R. Delmonte, eds., Natural Language Processing and Cognitive Science. Proceedings 2015. Libreria Editrice Cafoscarina. (pp. 51–61) pdf
  3. Ł. Dębowski, (2015). A New Universal Code Helps to Distinguish Natural Language from Random Texts. In: A. Tuzzi, M. Benesova, J. Mačutek, eds., Recent Contributions to Quantitative Linguistics. Berlin: De Gruyter Mouton. (pp. 41–50) pdf
  4. Ł. Dębowski, (2015). Regular Hilberg Processes: Nonexistence of Universal Redundancy Ratios. In: J. Rissanen, P. Harremoës, S. Forchhammer, Teemu Roos and P. Müllimäke, eds., Proceeding of the The Eighth Workshop on Information Theoretic Methods in Science and Engineering. University of Helsinki, Department of Computer Science. Series of Publications B, Report B-2015-1. (pp. 7–10) pdf
  5. Ł. Dębowski, (2013). Empirical Evidence for Hilberg's Conjecture in Single-Author Texts. In: I. Obradović, E. Kelih, R. Köhler, eds., Methods and Applications of Quantitative Linguistics—Selected papers of the 8th International Conference on Quantitative Linguistics (QUALICO). Belgrade: Academic Mind. (pp. 143-151) pdf
  6. Ł. Dębowski, (2012). Information-theoretic models of natural language. In: The Fifth Workshop on Information Theoretic Methods in Science and Engineering, WITMSE 2012, 27-30 August 2012, Amsterdam, Netherlands. (3 pages) pdf
  7. Ł. Dębowski, (2010). A link between the number of set phrases in a text and the number of described facts. In: P. Grzybek, E. Kelih, J. Mačutek, eds., Text and Language: Structures—Functions—Interrelations. Quantitative Perspectives. Wien: Praesens Verlag. (pp. 31-36)
  8. Ł. Dębowski, (2009). Computable Bayesian Compression for Uniformly Discretizable Statistical Models. In: R. Gavalda et al., eds., Proceedings of the 20th International Conference on Algorithmic Learning Theory, ALT 2009, Porto, Portugal, October 3-5, LNAI 5809. (pp. 53-67) pdf
    (The online version contains a few later corrections.)
  9. Ł. Dębowski, M. Woliński, (2007). Argument co-occurrence matrix as a description of verb valence In: 3rd Language & Technology Conference, October 5-7, 2007, Poznań, Poland. (pp. 260-264) pdf
  10. Ł. Dębowski, (2007). On vocabulary size of grammar-based codes. In: 2007 IEEE International Symposium on Information Theory. Nice, France, July 25-29. (pp. 91-95) arXiv
  11. Ł. Dębowski, (2006). Excess entropy—a link between probabilistic and algorithmic approaches to mutual information. In: Twenty-seventh Symposium on Information Theory in the Benelux. Noordwijk, The Netherlands, June 8-9, 2006. (pp. 141-148)
  12. Ł. Dębowski, (2004). Entropic Subextensivity in Language and Learning. In: C. Tsallis, M. Gell-Mann, eds., Nonextensive Entropy—Interdisciplinary Applications. Oxford University Press. (pp. 335-345)
  13. A. Przepiórkowski, Z. Krynicki, Ł. Dębowski, M. Woliński, D. Janus, P. Bański, (2004). A Search Tool for Corpora with Positional Tagsets and Ambiguities. In: Proceedings of the Fourth International Conference on Language Resources and Evaluation, LREC 2004. (pp. 1235-1238) pdf
  14. Ł. Dębowski, (2004). Trigram morphosyntactic tagger for Polish. In: M. A. Kłopotek, S. T. Wierzchoń, K. Trojanowski, eds., Intelligent Information Processing and Web Mining. Proceedings of the International IIS:IIPWM'04 Conference held in Zakopane, Poland, May 17-20, 2004. Springer. (pp. 409-413) pdf
  15. Ł. Dębowski, (2003). A reconfigurable stochastic tagger for languages with complex tag structure. In: Proceedings of the Workshop on Morphological Processing of Slavic Languages. 10th Conference of the European Chapter of Association for Computational Linguistics. Budapest 2003. (pp. 63-70) pdf
  16. P. Bański, A. Przepiórkowski, A. Kupść, Ł. Dębowski, M. Marciniak, A. Mykowiecka, (2003). The Design of the IPI PAN Corpus. In: PALC 2001: Practical Applications in Language Corpora. Peter Lang — Europäischer Verlag der Wissenschaften. (pp. 225-232)
  17. Ł. Dębowski, (2001). A Revision of Coding Theory for Learning from Language. In: Geert-Jan Kruiff, Lawrence S. Moss, Richard T. Oehrle, eds., Proceedings of Formal Grammar/Mathematics of Language Conference. August 10-12, 2001. Helsinki, Finland. Electronic Notes in Theoretical Computer Science, vol. 53. Elsevier. (15 pages)
Abstracts in conference proceedings
  1. Ł. Dębowski, R. Takahira, K. Tanaka-Ishii, (2016). Large Scale Entropy Entropy Rate Estimation: A New Law that Governs the Complexity of Language. In: Cognitive Systems Modeling: 5th Peripatetic Conference. Zakopane, Poland, 6-8 October 2016. (pp. 12-13)
  2. R. Ferrer-i-Cancho, Ł. Dębowski, (2013). Constant entropy rate and related hypotheses versus real language. In: 35th Annual Conference of the Cognitive Science Society. Berlin, Germany.
  3. Ł. Dębowski, (2006). The ergodic decomposition of excess entropy. In: XXVI European Meeting of Statisticians. Toruń, 24th-28th July 2006. (p. 36)
  4. Ł. Dębowski, (2005). Measure-theoretic information theory and its application to universal coding. In: Warsaw Probability Meeting. Warszawa, 15th-16th December 2005.
  5. Ł. Dębowski, (2005). Processes with absolutely summable partial autocorrelations. In: Proceedings of the 14th European Young Statisticians Meeting. Debrecen, 22nd-26th August 2005. (p. 19)
  6. Ł. Dębowski, (2005). Excess entropy, ergodic decomposition, and universal codes. In: Proceedings of the 30th Conference on Stochastic Processes and their Applications. University of California Santa Barbara, Santa Barbara, 26th June-1st July 2005. (p. 28)
  7. Ł. Dębowski, (2004). On the sum of autocorrelations of a process with absolutely summable partial autocorrelations. In: Proceedings of the 6th World Congress of the Bernoulli Society and the 67 Annual Meeting of the Institute of Mathematical Statistics. Barcelona, 26th-31th July 2004. (pp. 88-89)
  8. Ł. Dębowski, (2004). O sumie autokorelacji dla procesu stacjonarnego o bezwzględnie sumowalnej funkcji częściowej autokorelacji. In: VIII Konferencja z Probabilistyki. Będlewo, 17-21 maja 2004. Materiały konferencyjne. (pp. 21-22)
  9. Ł. Dębowski, (2003). The role of Hilberg's law in statistical language modeling. In: 4th Trier Colloquium on Quantitative Linguistics, Trier, 16th-18th October 2003.
Contributions to Festschrifts
  1. Ł. Dębowski, (2016). Estimation of entropy from subword complexity. S. Matwin, J. Mielniczuk, eds., Challenges in Computational Statistics and Data Mining. New York: Springer. (pp. 53-70) pdf
  2. Ł. Dębowski, (2012). Maximal Lengths of Repeat in English Prose. In: S. Naumann, P. Grzybek, R. Vulanović and G. Altmann, eds., Synergetic Linguistics. Text and Language as Dynamic System. Wien: Praesens Verlag. (pp. 23-30) pdf
  3. Ł. Dębowski, (2007). Menzerath's law for the smallest grammars. In: P. Grzybek, R. Köhler, eds., Exact Methods in the Study of Language and Text. Mouton de Gruyter. (pp. 77-85) pdf
Technical reports
  1. E. Hajnicz, Ł. Dębowski, M. Wiech (2007). Przykładowe zastosowanie gradacyjnej analizy danych w badaniach lingwistycznych. (An example of application of grade data analysis in linguistic research.) IPI PAN Reports, nr 1005.
  2. Ł. Dębowski, (2006). Ergodic decomposition of excess entropy and conditional mutual information. IPI PAN Reports, nr 993. pdf
  3. Ł. Dębowski, (2001). Tagowanie i dezambiguacja morfosyntaktyczna. Przegląd metod i oprogramowania. (Tagging and morphosyntactic disambiguation. A review of methods and software.) IPI PAN Reports, nr 934. pdf
  1. Ł. Dębowski, (2005). Własności entropii nadwyżkowej dla procesów stochastycznych nad różnymi alfabetami. (Excess entropy for stochastic processes over various alphabets.) PhD thesis, Institute of Computer Science, Polish Academy of Sciences. pdf
  2. Ł. Dębowski, (1999). Teoria funkcjonału gęstości dla monowarstw zaadsorbowanych na podłożu krystalicznym. (Density functional theory for monolayers adsorbed on a crystaline surface.) Master thesis, Faculty of Physics, Warsaw University.pdf
◂ home