Towards Faithful Neural Table-to-Text Generation with Content-Matching Constraints

Text generation from a knowledge base aims to translate knowledge triples to natural language descriptions. Most existing methods ignore the faithfulness between a generated text description and the original table, leading to generated information that goes beyond the content of the table. In this paper, for the first time, we propose a novel Transformer-based generation framework to achieve the goal. The core techniques in our method to enforce faithfulness include a new table-text optimal-transport matching loss and a table-text embedding similarity loss based on the Transformer model. Furthermore, to evaluate faithfulness, we propose a new automatic metric specialized to the table-to-text generation problem. We also provide detailed analysis on each component of our model in our experiments. Automatic and human evaluations show that our framework can significantly outperform state-of-the-art by a large margin.


Introduction
Understanding structured knowledge, e.g., information encoded in tables, and automatically generating natural-language descriptions is an important task in the area of Natural Language Generation. Table-to-text generation helps making knowledge elements and their connections in tables easier to comprehend by human. There have been a number of practical application scenarios in this field, for example, weather report generation, NBA news generation, biography generation and medical-record description generation (Liang et al., 2009;Barzilay and Lapata, 2005;Lebret et al., 2016a;Cawsey et al., 1997).
Most existing methods for table-to-text generation are based on an encoder-decoder framework (Sutskever et al., 2014;Bahdanau et al., Figure 1: An example of table-to-text generation. This generation is unfaithful because there exists information in table not covered by generated text (marked in blue); At the same time, hallucinated information in text does not appear in table (marked in red). 2015), most of which are RNN-based Sequenceto-Sequence (Seq2Seq) models (Lebret et al., 2016b;Liu et al., 2018;Wiseman et al., 2018;Ma et al., 2019;Liu et al., 2019a). Though significant progress has been achieved, we advocate two key problems in existing methods. Firstly, because of the intrinsic shortage of RNN, RNN-based models are not able to capture longterm dependencies, which would lose important information reflected in a table. This drawback prevents them from being applied to larger tables, for example, a table describing a large Knowledge Base . Secondly, little work has focused on generating faithful text descriptions, which is defined, in this paper, as the level of matching between a generated text sequence and the corresponding table content. An unfaithful generation example is illustrated in Figure 1. The training objectives and evaluation metrics of existing methods encourage generating texts to be as similar as possible to reference texts. One problem with this is that the reference text often contains extra information that is not presented in the table because human beings have external knowledge beyond the input table when writing the text, or it even misses some important information in the table (Dhingra et al., 2019) due to the noise from the dataset collection process. As a result, unconstrained training with such mis-matching information usually leads to hallucinated words or phrases in generated texts, making them unfaithful to the table and thus harmful in practical uses.
In this paper, we aim to overcome the above problems to automatically generate faithful texts from tables. In other words, we aim to produce the writing that a human without any external knowledge would do given the same table data as input. In contrast to existing RNN-based models, we leverage the powerful attention-based Transformer model to capture long-term dependencies and generate more informative paragraphlevel texts. To generate descriptions faithful to tables, two content-matching constraints are proposed. The first one is a latent-representationlevel matching constraint encouraging the latent semantics of the whole text to be consistent with that of the whole table. The second one is an explicit entity-level matching scheme, which utilizes Optimal-Transport (OT) techniques to constrain key words of a table and the corresponding text to be as identical as possible. To evaluate the faithfulness, we also propose a new PARENT-T metric evaluating the content matching between texts and tables, based on the recently proposed PARENT (Dhingra et al., 2019) metric. We train and evaluate our model on a large-scale knowledge base dataset . Automatic and human evaluations both show that our method achieve the state-of-the-art performance, and can generates paragraph-level descriptions much more informative and faithful to input tables.

The Proposed Method
The task of text generation for a knowledge base is to take the structured table, T = {(t 1 , v 1 ), (t 2 , v 2 ), , (t m , v m )}, as input, and outputs a natural-language description consisting of a Figure 2: The architecture of our proposed model for table-to-text generation. To enhance the ability of generating multi-sentence faithful texts, our loss consists of three parts, including a maximum-likelihood loss (green), a latent matching disagreement loss (orange), and an optimal-transport loss (blue). sequence of words y = {y 1 , y 2 , , y n } that is faithful to the input table. Here, t i denotes the slot type for the i th row, and v i denotes the slot value for the i th row in a table.
Our model adopts the powerful Transformer model (Vaswani et al., 2017) to translate a table to a text sequence. Specifically, the Transformer is a Seq2Seq model, consisting of an encoder and a decoder. Our proposed encoder-to-decoder Transformer model learns to estimate the conditional probability of a text sequence from a source table input in an autoregressive way: where θ is the Transformer parameters and y <i denotes the decoded words from previous steps. Existing models for table-to-text generation either only focus on generating text to match the reference text (Liu et al., 2018;Ma et al., 2019), or only require a generated text sequence to be able to cover the input table . However, as the only input information is the table, the generated text should be faithful to the input table as much as possible. Therefore, we propose two constraint losses, including a tabletext disagreement constraint loss and a constrained content matching loss with optimal transport, to encourage the model to learn to match between the generated text and the input table faithfully. Figure 2 illustrates the overall architecture of our model. In summary, our model loss contains three parts: 1) a maximum likelihood loss (green) that measures the matching between a model prediction and the reference text sequence; 2) a latent feature matching disagreement loss (orange) that measures the disagreement between a table encoding and the corresponding reference-text encoding; and 3) an optimal-transport loss (blue) matching the key words of an input table and the corresponding generated text.

Table Representation
The entities of a table simply consists of Slot Type and Slot Value pairs. To apply the Transformer model, we first linearize input tables into sequences. Slot types and slot values are separated by special tokens "<" and ">". As an example, the table in Figure 1 is converted into a sequence: {< Name ID >, Willie Burden, < date of birth > , July 21 1951, · · · }. We note that encoding a table in this way might lose some high-order structure information presented in the original knowledge graph. However, our knowledge graph is relatively simple. According to our preliminary studies, a naive combination of feature extracted with graph neural networks (Beck et al., 2018) does not seem helpful. As a result, we only rely on the sequence representation in this paper.

The Base Objective
Our base objective comes from the standard Transformer model, which is defined as the negative log-likelihood loss L mle of a target sentence y given its input T , i.e., with P (y|T ; θ) defined in (1).

Faithfulness Modeling with a Table-Text Disagreement Constraint Loss
One key element of our model is to enforce a generated text sequence to be consistent with (or faithful to) the table input. To achieve this, we propose to add some constraints so that a generated text sequence only contains information from the table.
Our first idea is inspired by related work in machine translation . Specifically, we propose to constrain a table embedding to be close to the corresponding target sentence embedding. Since the embedding of a text sequence (or the table) in our model is also represented as a sequence, we propose to match the mean embeddings of both sequences. In fact, the mean embedding has been proved to be an effective representation for the whole sequence in machine translation Wang et al., 2017). Let V table andV text be the mean embeddings of a table and the target text embeddings in our Transformerbased model, respectively. A table-target sentence disagreement loss L disagree is then defined as

Faithfulness Modeling with Constrained Content Matching via Optimal Transport
Our second strategy is to explicitly match the key words in a table and the corresponding generated text. In our case, key words are defined as nouns, which can be easily extracted with existing tools such as NLTK (Loper and Bird, 2002). To match key words, a mis-matching loss should be defined. Such a mis-matching loss could be non-differentiable, e.g., when the loss is defined as the number of matched entities. In order to still be able to learn by gradient descent, one can adopt the policy gradient algorithm to deal with the non-differentiability. However, policy gradient is known to exhibit high variance. To overcome this issue, we instead propose to perform optimization via optimal transport (OT), inspired by the recent techniques in (Chen et al., 2019a).
Optimal-Transport Distance In the context of text generation, a generated text sequence, y = (y 1 , · · · , y n ), can be represented as a discrete distribution µ = n i=1 u i δ y i (·), where u i ≥ 0 and i u i = 1, δ x (·) denotes a spike distribution located at x. Given two discrete distributions µ and ν, written as µ = n i=1 u i δ x i and ν = m j=1 v j δ y j , respectively, the OT distance between µ and ν is defined as the solution of the following maximum network-flow problem: where d(x, y) is the cost of moving x to y (matching x and y). In this paper, we use the cosine distance between the two word-embedding vectors of x and y, defined as d(x, y) = 1 − xy x 2 y 2 . Π(µ, ν) is the set of joint distributions such that the two marginal distributions equal to µ and ν, respectively. Figure 3: Illustration of the OT loss, which is defined with OT distance to only match key words in both the table and the generated sentence. Left: the generated sentence not only contains extra information not presented in the table (shown as orange), but also lacks some information presented in the table (shown as red). This is unfaithful generation. The OT lost is thus high. Right: all information in the table is covered in the generated sentence, and the generated sentence does not contain extra information not presented in the table. This is faithful generation. The OT cost is thus low. This example is borrowed and modified from (Dhingra et al., 2019).
Exact minimization over U in the above problem is in general computational intractable (Genevay et al., 2018). Therefore, we adopt the recently proposed Inexact Proximal point method for Optimal Transport (IPOT) (Xie et al., 2018) as an approximation. The details of the IPOT algorithm are shown in Appendix C.
Constrained Content Matching via OT To apply the OT distance to our setting, we need to first specify the atoms in the discrete distributions. Since nouns typically are more informative, we propose to match the nouns in both an input table and the decoded target sequence. We use NLTK (Loper and Bird, 2002) to extract the nouns that are then used for computing the OT loss. In this way, the computational cost can also be significantly reduced comparing to matching all words.
The OT loss can be used as a metric to measure the goodness of the match between two sequences. To illustrate the motivation of applying the OT loss to our setting, we provide an example illustrated in Figure 3, where we try to match the table with the two generated text sequences. On the left plot, the generated text sequence contains "California brand Grateful Dead", which is not presented in the input table. Similarly, and the phrases "Seattle, Washington" and "Skokie Illinois" in the table are not covered by the generated text. Consequently, the resulting OT loss will be high. By contrast, on the right plot, the table contains all information in the text, and all the phrases in the table are also covered well by the generated text, leading to a low OT loss. As a result, optimizing over the OT loss in (4) would enforce faithful matching be-tween a table and its generated text.
Optimization via OT When optimizing the OT loss with the IPOT algorithm, the gradients of the OT loss is required to be able to propagate back to the Transformer component. In other words, this requires gradients to flow back from a generated sentence. Note that a sentence is generated by sampling from a multinomial distribution, whose parameter is the Transformer decoder output represented as a logit vector S t for each word in the vocabulary. This sampling process is unfortunately non-differentiable. To enable backpropagation, we follow Chen et al. (2019a) and use the Soft-argmax trick to approximate each word with the corresponding soft-max output.
To further reduce the number of parameters and improve the computational efficiency, we adopt the factorized embedding parameterization proposed recently (Lan et al., 2019). Specifically, we decompose a word embedding matrix of size V × D into the product of two matrices of sizes V × H and H × D, respectively. In this way, the parameter number of the embedding matrices could be significantly reduced as long as H is to be much smaller than D.

The Final Objective
Combing all the above components, the final training loss of our model is defined as: where λ and γ controls the relative importance of each component of the loss function.

Decoder with a Copy Mechanism
To enforce a generated sentence to stick to the words presented in the table as much as possible, we follow (See et al., 2017) to employ a copy mechanism when generating an output sequence. Specifically, let P vocab be the output of the Transformer decoder. P vocab is a discrete distribution over the vocabulary words and denotes the probabilities of generating the next word. The standard methods typically generate the next word by directly sampling from P vocab . In the copy mechanism, we instead generate the next word y i with the following discrete distribution: is the probability of switching sampling between P vocab and P att , with learnable parameters (W 1 , b 1 ) and h i as the hidden state from the Transformer decoder for the i-th word. P att is the attention weights (probability) returned from the encoder-decoder attention module in the Transformer. Specifically, when generating the current word y i , the encoder-decoder attention module calculates the probability vector P att denoting the probabilities of attending to each word in the input table. Note that the probabilities of the words not presented in the table are set to zero.

Experiments
We conduct experiments to verify the effectiveness and superiority of our proposed approach against related methods.

Dataset
Our model is evaluated on the large-scale knowledge-base Wikiperson dataset released by  ) whose reference text only contains one-sentence descriptions, this dataset contains multiple sentences for each table to cover as many facts encoded in the input structured knowledge base as possible.

Evaluation Metrics
For automatic evaluation, we apply the widely used evaluation metrics including the standard BLEU-4 (Papineni et al., 2002), METEOR  (Denkowski and Lavie, 2014) and ROUGE (Lin, 2004) scores to evaluate the generation quality.
Since these metrics rely solely on the reference texts, they usually show poor correlations with human judgments when the references deviate too much from the table. To this end, we also apply the PARENT (Dhingra et al., 2019) metric that considers both the reference texts and table content in evaluations. To evaluate the faithfulness of the generated texts, we further modify the PARENT metric to measure the level of matching between generated texts and the corresponding tables. We denote this new metric as PARENT-T. Please see Appendix A for details. Note that the precision in PARENT-T corresponds to the percentage of words in a text sequence that co-occur in the table; and the recall corresponds to the percentage of words in a table that co-occur in the text.

Baseline Models
We compare our model with several strong baselines, including • The vanilla Seq2Seq attention model (Bahdanau et al., 2015).
• The method in : The stateof-art model on the Wikiperson dataset.
• The method in (Liu et al., 2018): The stateof-the-art method on the WikiBio dataset.
• The pointer-generator (See et al., 2017): A Seq2Seq model with attention, copying and coverage mechanism.

Implementation Details
Our implementation is based on OpenNMT (Klein et al., 2017). We train our models end-to-end to minimize our objective function with/without the copy mechanism. The vocabulary is limited to   Table 3: Ablation study of our model components. means the corresponding column component is used. means do not use the corresponding column component. Specifically, "Copy" means using copy mechanism, "EF" means using embedding factorization, "OT" means using optimal transport constraint loss, "N" means extracting nouns from both the table and text, and "W" means using the whole table and text to compute OT. Lastly, "latent" means using latent similarity loss.
the 50, 000 most common words in the training dataset. The hidden units of the multi-head component and the feed-forward layer are set to 2048. The baseline embedding size is 512. Following (Lan et al., 2019), the embedding size with embedding factorization is set to be 128. The number of heads is set to 8, and the number of Transformer blocks is 3. Beam size is set to be 5. Label smoothing is set to 0.1.
For the optimal-transport based regularizer, we first train the model without OT for about 20,000 steps, then fine tune the network with OT for about 10,000 steps. We use the Adam (Kingma and Ba, 2015) optimizer to train the models. We set the hyper-parameters of Adam optimizer accordingly, including the learning rate α = 0.00001, and the two momentum parameters, batch size = 4096 (tokens) and β 2 = 0.998.

Results
Table 1 and 2 show the experiment results in terms of different evaluation metrics compared with different baselines. "Ours" means our proposed model with components of copy mechanism, embedding factorization, OT-matching with nouns, and latent similarity loss 1 . We can see that our model outperforms existing models in all of the automatic evaluation scores, indicating high quality of the generated texts. The superiority of the PARENT-T scores (in terms of precision and recall) indicates that the generated text from our model is more faithful than others. Example out-Precision Recall F-1 measure Fluency Grammar  76.    Figure 4. The blue color indicates the corresponding row appears in the input table, but not in the output generation text. The red color indicates that these entities appear in the text but do not appear in the input table.
puts from different models are shown in Table 5 with an input table shown in Figure 4. In this example, our model covers all the entities in the input, while all other models miss some entities. Furthermore, other models hallucinate some information that does not appear in the input, while our model generates almost no extra information other than that in the input. These results indicate the faithfulness of our model. More examples are shown in Appendix E.

Ablation Study
We also conduct extensive ablation studies to better understand each component of our model, including the copy mechanism, embedding factorization, optimal transport constraint loss, and latent similarity loss. Table 3 shows the results in different evaluation metrics.
Effect of copy mechanism The first and second rows in Table 3 demonstrate the impacts of the copy mechanism. It is observed that with the copy mechanism, one can significantly improve the performance in all of the automatic metrics, especially on the faithfulness reflected by the PARENT-T score.

Effect of embedding factorization
We compare our model with the one without embedding factorization. The comparisons are shown in the second and third rows of Table 3. We can see that with embedding factorization, around half of the parameters can be reduced, while comparable performance can still be maintained.

Effect of table-text embedding similarity loss
We also test the model by removing the table-text embedding similarity loss component. The third and fourth rows in Table 3 summarize the results. With the table-text embedding similarity loss, the BLEU and METEOR scores drop a little, but the PARENT and PARENT-T scores improve over the model without the loss. This is reasonable because the loss aims at improving faithfulness of generated texts, reflected by the PARENT-T score.
Effect of the OT constraint loss We further compare the performance of the model (a) without using OT loss, (b) with using the whole table and text to compute OT, and (c) with using the extracted nouns from both table and text to compute OT. Results are presented in the third, fifth, and sixth rows of Table 3, respectively. The model with the OT loss improve performance on almost all scores, especially on the PARENT-T score. Furthermore, with only using the nouns to compute the OT loss, one can obtain even better results. These results demonstrate the effectiveness of the proposed OT loss on enforcing the model to be faithful to the original table.

Human Evaluation
Following Tian et al., 2019), we conduct extensive human evaluation on the generated descriptions and compare the results to the state-of-the-art methods. We design our evaluation criteria based on Tian et al., 2019), but our criteria differs from (Tian et al., 2019) in several aspects. Specifically, for each group of generated texts, we ask the human raters to evaluate the grammar, fluency, and faithfulness. The human evaluation metrics of faithfulness is defined in terms of precision, recall and F1-score with respect to the reconstructed Knowledge-base  Table 4 clearly show the superiority of our proposed method. Table-to-text generation has been widely studied, and Seq2Seq models have achieved promising performance. (Lebret et al., 2016b;Liu et al., 2018;Wiseman et al., 2018;Ma et al., 2019;Liu et al., 2019a). For Transformer-based methods, the Seq2Seq Transformer is used by Ma et al. (2019) for table-to-text generation in lowresource scenario. Thus, instead of encoding an entire table as in our approach, only the predicted key facts are encoded in (Ma et al., 2019). Extended transformer has been applied to game summary (Gong et al., 2019) and E2E NLG tasks (Gehrmann et al., 2018). However, their goals focus on matching the reference text instead of being faithful to the input.

Related Work
Another line of work attempts to use external knowledge to improve the quality of generated text (Chen et al., 2019b). These methods allow generation from an expanded external knowledge base that may contain information not relevant to the input table. Comparatively, our setting requires the generated text to be faithful to the input table. Nie et al. (2018) further study fidelitydata-to-text generation, where several executable symbolic operations are applied to guide text generation. Both models do not consider the matching between the input and generated output.
Regarding datasets, most previous methods are trained and evaluated on much simpler datasets like WikiBio (Lebret et al., 2016b) that contains only one sentence as a reference description. Instead, we focus on the more complicated structured knowledge base dataset  that aims to generate multi-sentence texts.  propose a model based on the pointer network that can copy facts directly from the input knowledge base. Our model uses a similar strategy but obtains much better performance.
In terms of faithfulness, one related parallel work is Tian et al. (2019). However, our method is completely different from theirs. Specifically, Tian et al. (2019) develop a confidence oriented decoder that assigns a confidence score to each target position to reduce the unfaithful information in the generated text. Comparatively, our method enforces faithfulness by including the proposed table-text optimal-transport matching loss and table-text embedding similarity loss. Moreover, the faithfulness of Tian et al. (2019) only requires generated texts to be supported by either a table or the reference; whereas ours constrains generated texts to be faithful only to the table.
Other related works are (Perez-Beltrachini and Lapata, 2018;Liu et al., 2019b). For (Perez-Beltrachini and Lapata, 2018), the content selection mechanism training with multi-task learning and reinforcement learning is proposed. For (Liu et al., 2019b), they propose force attention and reinforcement learning based method. Their learning methods are completely different from our method that simultaneously incorporates optimaltransport matching loss and embedding similarity loss. Moreover, the REINFORCE algorithm (Williams, 1992) and policy gradient method used in (Perez-Beltrachini and Lapata, 2018;Liu et al., 2019b) exhibits high variance when training the model.
Finally, the content-matching constraints between text and table is inspired by ideas in machine translation  and Seq2Seq models (Chen et al., 2019a).

Conclusion
In this paper, we propose a novel Transformerbased table-to-text generation framework to address the faithful text-generation problem. To enforce faithful generation, we propose a new tabletext optimal-transport matching loss and a tabletext embedding similarity loss. To evaluate the faithfulness of the generated texts, we further propose a new automatic evaluation metric specialized to the table-to-text generation problem. Extensive experiments are conducted to verify the proposed method. Both automatic and human evaluations show that our framework can significantly outperform the state-of-the-art methods. The natural language toolkit. In https://arxiv.org/abs/cs/0205028.

A PARENT-T Metric
PARENT-T evaluates each instance (T i , G i ) separately, by computing the precision and recall of generated text G i against table T i . In other words, PARENT-T is a table-focused version of PARENT (Dhingra et al., 2019).
When computing precision, we want to check what fraction of the n-grams in G i n are correct. We consider an n-gram g to be correct if it has a high probability of being entailed by the table. We use the word overlap model for entailment probability w(g). The precision score E p for one instance is computed as follows: whereT i denotes all the lexical items present in the table T i , n is the length of g, and g j is the jth token in g. w(g) is the entailment probability, and E n p is the entailed precision score for n-grams of order n. # G i n (g) denotes the count of n-gram g in G i n . The precision score E p is a combination of n-gram orders 1-4 using a geometric average.
For recall, we only compute it against table to ensure that texts that mention more information from the table get higher scores. E r (T i ) is computed in the same way as in Dhingra et al. (2019): where a table is a set of records T i = {r k } K k=1 , r k denotes the value string of record r k , and LCS(x, y) is the length of the longest common subsequence between x and y. Higher values of E r (T i ) denote that more records are likely to be mentioned in G i .
Thus, the PARENT-T score (i.e. F score) for one instance is: The system-level PARENT-T score for a model M is the average of instance-level PARENT-T scores across the evaluation set.

B Details of Human Evaluation
The following are the details for instructing our human evaluation raters how to rate each generated sentence: We only provide the input table and the generated text for the raters. There are 20 well-trained raters participating in the evaluation.
Fluency : 4: The sentence meaning is clear and flow naturally and smoothly.
3: The sentence meaning is clear, but there are a few interruptions.
2: The sentence does not flow smoothly but people can understand its meaning.
1: The sentence is not fluent at all and people cannot understand its meaning.
Grammar : 4 : There are no grammar errors. 3: There are a few grammar errors, but sentence meaning is clear.
2: There are some grammar errors, but not influencing its meaning.
1: There are many grammar errors. People cannot understand the sentence meaning.
Faithfulness A sentence is faithful if it contains only information supported by the table. It should not contain additional information other than the information provided by the table or inferred from the table. Also, the generated sentence should cover as much information in the given table as possible. The raters first manually extract entities from the generated sentences and then calculate the precision as the percentage of entities in the generated text also appear in the table; calculate the recall as the percentage of entities in the table also appear in the generated text. For each tabletext pair, its F-1 score is then calculated according to the precision and recall.

C IPOT algorithm
Given a pair of table and its corresponding text description, we can obtain table words embedding as S = {x i } i=n i=1 , and the model output for sentence words embedding as S = {y j } j=m j=1 . The cost matrix C is then computed as in Section 2.4. Both S and S are used as inputs to the IPOT algorithm in Algorithm 1 to obtain the OT-matching distance.
Algorithm 1 IPOT algorithm. Figure 5 illustrates three matching cases from top to bottom, namely hard matching, soft bipartite matching, and optimal transport matching. The hard matching stands for exactly matching words between the table and the target sequences. This operation is non-differentiable. The soft bipartite matching, on the other hand, supposes the similarity between the word embedding v i k and v j k is d(v i k , v j k ), and finds the matching such that

D Details of Optimal Transport Loss
is minimized. This minimization can be solved exactly by the Hungarian algorithm (Kuhn, 1955). But, its objective is still non-differentiable. Our proposed optimal transport matching can be viewed as the relaxed problem of the soft bipartite matching by computing the distance between the distribution over the input table and the decoded text sentence. This distance in optimal transport matching is differentiable.

E More generation examples
More generation examples from different models are shown in Figure 6, 7, and Table 6, 7. Specifically, Table 7 and Figure 7 show a more challenging example, as its table has 22 rows. In this example, we can observe that all the RNN-based models cannot capture such long term dependencies and miss most of the input records in the table. By contrast, our model miss much less input records.

Model
Miss Generated texts     Figure 6. The "Miss" column indicates the corresponding row appears in the input table, but does not appear in the output generation text. The red color indicates that these entities appear in the text but do not appear in the input table.
Pointer generator 2, 4, 5, 6, 7, 8, 10, 11, 12, 13, 14, 16, 16, 20É mile Mbouh, (born 30 May 1966) is a Cameroon retired Association football who played as a Midfielder . he played for Cameroon national football team in the 1990 FIFA World Cup . he also played for Perlis FA and Liaoning Whowin F.C. .Émile was born in Douala, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 20É mile Mbouh, (born 30 May 1966) is a retired Cameroonian Association football who played as a Midfielder . he was born in Douala . he was a member of the Cameroon national football team at the 1990 FIFA World Cup . he was a member of the Cameroon national football team at the 1990 FIFA World Cup . he was a member of the Cameroon national football team at the 1990 FIFA World Cup . he was a member of the Cameroon national football team at the 1990 FIFA World Cup . 3,4,5,6,8,10,11,12,13,14,15,16,17 Figure 7. The "Miss" column indicates the corresponding row appears in the input table, but does not appear in the output generation text. The red color indicates that these entities appear in the text but do not appear in the input table.