The Uniform Minimum-Ones 2SAT Problem and its Application to Haplotype Classification

Hans-Joachim Böckenhauer; Michal Forišek; Ján Oravec; Björn Steffen; Kathleen Steinhöfel; Monika Steinová

RAIRO - Theoretical Informatics and Applications (2010)

  • Volume: 44, Issue: 3, page 363-377
  • ISSN: 0988-3754

Abstract

top
Analyzing genomic data for finding those gene variations which are responsible for hereditary diseases is one of the great challenges in modern bioinformatics. In many living beings (including the human), every gene is present in two copies, inherited from the two parents, the so-called haplotypes. In this paper, we propose a simple combinatorial model for classifying the set of haplotypes in a population according to their responsibility for a certain genetic disease. This model is based on the minimum-ones 2SAT problem with uniform clauses. The minimum-ones 2SAT problem asks for a satisfying assignment to a satisfiable formula in 2CNF which sets a minimum number of variables to true. This problem is well-known to be 𝒩𝒫 -hard, even in the case where all clauses are uniform, i.e., do not contain a positive and a negative literal. We analyze the approximability and present the first non-trivial exact algorithm for the uniform minimum-ones 2SAT problem with a running time of 𝒪 (1.21061n) on a 2SAT formula with n variables. We also show that the problem is fixed-parameter tractable by showing that our algorithm can be adapted to verify in 𝒪 * (2k) time whether an assignment with at most k true variables exists.

How to cite

top

Böckenhauer, Hans-Joachim, et al. "The Uniform Minimum-Ones 2SAT Problem and its Application to Haplotype Classification." RAIRO - Theoretical Informatics and Applications 44.3 (2010): 363-377. <http://eudml.org/doc/250809>.

@article{Böckenhauer2010,
abstract = { Analyzing genomic data for finding those gene variations which are responsible for hereditary diseases is one of the great challenges in modern bioinformatics. In many living beings (including the human), every gene is present in two copies, inherited from the two parents, the so-called haplotypes. In this paper, we propose a simple combinatorial model for classifying the set of haplotypes in a population according to their responsibility for a certain genetic disease. This model is based on the minimum-ones 2SAT problem with uniform clauses. The minimum-ones 2SAT problem asks for a satisfying assignment to a satisfiable formula in 2CNF which sets a minimum number of variables to true. This problem is well-known to be $\mathcal\{NP\}$-hard, even in the case where all clauses are uniform, i.e., do not contain a positive and a negative literal. We analyze the approximability and present the first non-trivial exact algorithm for the uniform minimum-ones 2SAT problem with a running time of $\mathcal\{O\}$(1.21061n) on a 2SAT formula with n variables. We also show that the problem is fixed-parameter tractable by showing that our algorithm can be adapted to verify in $\mathcal\{O\}^*$(2k) time whether an assignment with at most k true variables exists. },
author = {Böckenhauer, Hans-Joachim, Forišek, Michal, Oravec, Ján, Steffen, Björn, Steinhöfel, Kathleen, Steinová, Monika},
journal = {RAIRO - Theoretical Informatics and Applications},
keywords = {Exact algorithms; fixed-parameter algorithms; minimum-ones 2SAT; haplotypes; exact algorithms},
language = {eng},
month = {10},
number = {3},
pages = {363-377},
publisher = {EDP Sciences},
title = {The Uniform Minimum-Ones 2SAT Problem and its Application to Haplotype Classification},
url = {http://eudml.org/doc/250809},
volume = {44},
year = {2010},
}

TY - JOUR
AU - Böckenhauer, Hans-Joachim
AU - Forišek, Michal
AU - Oravec, Ján
AU - Steffen, Björn
AU - Steinhöfel, Kathleen
AU - Steinová, Monika
TI - The Uniform Minimum-Ones 2SAT Problem and its Application to Haplotype Classification
JO - RAIRO - Theoretical Informatics and Applications
DA - 2010/10//
PB - EDP Sciences
VL - 44
IS - 3
SP - 363
EP - 377
AB - Analyzing genomic data for finding those gene variations which are responsible for hereditary diseases is one of the great challenges in modern bioinformatics. In many living beings (including the human), every gene is present in two copies, inherited from the two parents, the so-called haplotypes. In this paper, we propose a simple combinatorial model for classifying the set of haplotypes in a population according to their responsibility for a certain genetic disease. This model is based on the minimum-ones 2SAT problem with uniform clauses. The minimum-ones 2SAT problem asks for a satisfying assignment to a satisfiable formula in 2CNF which sets a minimum number of variables to true. This problem is well-known to be $\mathcal{NP}$-hard, even in the case where all clauses are uniform, i.e., do not contain a positive and a negative literal. We analyze the approximability and present the first non-trivial exact algorithm for the uniform minimum-ones 2SAT problem with a running time of $\mathcal{O}$(1.21061n) on a 2SAT formula with n variables. We also show that the problem is fixed-parameter tractable by showing that our algorithm can be adapted to verify in $\mathcal{O}^*$(2k) time whether an assignment with at most k true variables exists.
LA - eng
KW - Exact algorithms; fixed-parameter algorithms; minimum-ones 2SAT; haplotypes; exact algorithms
UR - http://eudml.org/doc/250809
ER -

References

top
  1. B. Aspvall, M.F. Plass and R.E. Tarjan, A linear-time algorithm for testing the truth of certain quantified boolean formulas. Inf. Proc. Lett.8 (1979) 121–123.  
  2. H.-J. Böckenhauer and D. Bongartz, Algorithmic Aspects of Bioinformatics. Natural Computing Series, Springer-Verlag (2007).  
  3. P. Bonizzoni, G.D. Vedova, R. Dondi and J. Li, The haplotyping problem: An overview of computational models and solutions. J. Comput. Sci. Technol.18 (2003) 675–688.  
  4. J. Chen, I. Kanj and W. Jia, Vertex cover: further observations and further improvements. J. Algorithms41 (2001) 280–301.  
  5. I. Dinur and S. Safra, On the hardness of approximating minimum vertex cover. Ann. Math.162 (2005) 439–485.  
  6. D. Gusfield and L. Pitt, A bounded approximation for the minimum cost 2-sat problem. Algorithmica8 (1992) 103–117.  
  7. J. Hromkovič, Algorithmics for Hard Problems. Introduction to Combinatorial Optimization, Randomization, Approximation, and Heuristics. Texts in Theoretical Computer Science, An EATCS Series, Springer-Verlag, Berlin (2003).  
  8. G. Karakostas, A better approximation ratio for the vertex cover problem. Technical Report TR04-084, ECCC (2004).  
  9. S. Khot and O. Regev, Vertex cover might be hard to approximate to within 2-epsilon. J. Comput. Syst. Sci.74 (2008) 335–349.  
  10. J. Kiniwa, Approximation of self-stabilizing vertex cover less than 2, in Self Stabilizing Systems (2005) 171–182.  
  11. E.L. Lawler and D.E. Wood, Branch-and-bound methods: A survey. Operat. Res.14 (1966) 699–719.  
  12. J. Li and T. Jiang, A survey on haplotyping algorithms for tightly linked markers. J. Bioinf. Comput. Biol.6 (2008) 241–259.  
  13. J.M. Robson, Finding a maximum independent set in timeO(2n/4). Technical Report 1251-01, LaBRI, Université Bordeaux I (2001).  

NotesEmbed ?

top

You must be logged in to post comments.

To embed these notes on your page include the following JavaScript code on your page where you want the notes to appear.

Only the controls for the widget will be shown in your chosen language. Notes will be shown in their authored language.

Tells the widget how many notes to show per page. You can cycle through additional notes using the next and previous controls.

    
                

Note: Best practice suggests putting the JavaScript code just before the closing </body> tag.