The Mutual Information in the Vicinity of Capacity-Achieving Input Distributions


NAKİBOĞLU B., Cheng H.

IEEE Transactions on Information Theory, vol.71, no.8, pp.5771-5787, 2025 (SCI-Expanded) identifier identifier

  • Publication Type: Article / Article
  • Volume: 71 Issue: 8
  • Publication Date: 2025
  • Doi Number: 10.1109/tit.2025.3562098
  • Journal Name: IEEE Transactions on Information Theory
  • Journal Indexes: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, PASCAL, Aerospace Database, Applied Science & Technology Source, Business Source Elite, Business Source Premier, Communication Abstracts, Compendex, Computer & Applied Sciences, INSPEC, MathSciNet, Metadex, zbMATH, Civil Engineering Abstracts
  • Page Numbers: pp.5771-5787
  • Keywords: Fisher Information, Moreaus decomposition theorem, Mutual information, polyhedral convexity, Shannon center, Taylors theorem
  • Middle East Technical University Affiliated: Yes

Abstract

The mutual information is bounded from above by a decreasing affine function of the square of the distance between the input distribution and the set of all capacity-achieving input distributions ΠA, on small enough neighborhoods of ΠA, using an identity due to Topsoe and the Pinsker's inequality, assuming that the input set of the channel is finite and the constraint set A is polyhedral, i.e., can be described by (possibly multiple but) finitely many linear constraints. Counterexamples demonstrating nonexistence of such a quadratic bound are provided for the case of infinitely many linear constraints and the case of infinite input sets. Using Taylor's theorem with the remainder term, rather than the Pinsker's inequality and invoking Moreau's decomposition theorem the exact characterization of the slowest decrease of the mutual information with the distance to ΠA is determined on small neighborhoods of ΠA. Corresponding results for classical-quantum channels are established under separable output Hilbert space assumption for the quadratic bound and under finite-dimensional output Hilbert space assumption for the exact characterization. Implications of these observations for the channel coding problem and applications of the proof techniques to related problems are discussed.